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HUMAN NEUTRALIZING MONOCLONAL ANTIBODIES 
TO HUMAN IMMUNODEFICIENCY VIRUS 

Technical Field 
5 The present invention relates generally to the 

field of immunology and specifically to human 
monoclonal antibodies which bind and neutralize human 
immunodeficiency virus (HIV) . 

10 Background 

1. HIV Immunotherapy 

HIV is the focus of intense studies as it is 
the causative agent for acquired immunodeficiency 
syndrome (AIDS) . Immunotherapeutic methods are one of 

15 several approaches to prevention, cure or remediation 
of HIV infection and HIV- induced diseases. 
Specifically, the use of neutralizing antibodies in 
passive immunotherapies is of central importance to 
the present invention. 

20 Passive immunization of HIV-1 infected humans 

using human sera containing polyclonal antibodies 
immunoreactive with HIV has been reported. See for 
example, Jackson et al., liancet . September 17:647-652, 
(1988); Karpas et al., Pr oc . Na t 1 . Acad . Sc i . , USA , 

25 87:7613-7616 (1990). 

Numerous groups have reported the preparation of 
human monoclonal antibodies that neutralize HIV 
isolates in vitro . The described antibodies typically 
have immunospecif icities for epitopes on the HIV 

30 glycoprotein gpl20 or the related external surface 
envelope glycoprotein gpl20 or the transmembrane 
glycoprotein gp41. See, for example Levy, Micro. 
Rev. . 57:183-289 (1993); Karwowska et al., Aids 
Research and Human Retroviruses . 8:1099-1106 (1992); 

35 Takeda et al., J. Clin. Invest. . 89:1952-1957 (1992); 
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Tilley et al. , Aids Research and Human Retroviruses , 
8:461-467 (1992); Laman et al., J. Virol. . 66:1823- 
1831 (1992); Thali et al., J. Virol. . 65:6188-6193 
(1991); Ho et al., Proc. Natl. Acad. Sci. USA , 
5 88:8949-8952 (1991); D'Souza et al., AIDS . 5:1061- 

1070 (1991); Tilley et al., Res. Virol. , 142:247-259 
(1991); Broliden et al., Immunol. . 73:371-376 (1991); 
Matour et al., J. Immunol. , 146:4325-4332 (1991); and 
Gorny et al., Proc. Natl. Acad. Sci., USA , 88:3238- 

10 3242 (1991) . 

To date, none of the reported human monoclonal 
antibodies have been shown to be effective in passive 
immunization therapies. Further, as monoclonal 
antibodies, they all each react with an individual 

15 epitope on the HIV envelope glycoprotein, gpl20 or 
gpl60. The epitope against which an effective 
neutralizing antibody immunoreacts has not been 
identified. 

There continues to be a need to develop human 
20 monoclonal antibody preparations with significant HIV 
neutralization activity. In addition, there is a need 
for monoclonal antibodies immunoreactive with 
additional and diverse neutralizing epitopes on HIV 
gpl20 and gp4l in view of recent studies suggesting 
25 that gpl20 and gp41 are involved in both binding of 
the HIV virus to the cell as well as in postbinding 
events including envelope shedding and cleavage. See, 
for review, Levy, Micro. Rev. . 57:183-289 (1993). 
Additional (new) epitope specificities are required 
30 because, upon passive immunization, the administered 
patient can produce an immune response against the 
administered antibody, thereby inactivating the 
particular therapeutic antibody. 

35 2 . Human Monoclonal Antibodies Produced From 
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Combinatorial Phacremid Libraries 
The use of filamentous phage display vectors f 
referred to as phagemids, has been repeatedly shown to 
allow the efficient preparation of large libraries of 
5 monoclonal antibodies having diverse and novel 
immunospecif icities. The technology uses a 
filamentous phage coat protein membrane anchor domain 
as a means for linking gene-product and gene during 
the assembly stage of filamentous phage replication, 

10 and has been used for the cloning and expression of 

antibodies from combinatorial libraries. Kang et al., 
Proc. Natl. Acad. Sci.. USA . 88:4363-4366 (1991). 
Combinatorial libraries of antibodies have been 
produced using both the cpVIII membrane anchor (Kang 

15 et al., supra ) and the cpIII membrane anchor. Barbas 
et al., Proc. Natl. Acad. Sci., USA . 88:7978-7982 
(1991). 

The diversity of a filamentous phage-based 
combinatorial antibody library can be increased by 
20 shuffling of the heavy and light chain genes (Kang et 
■al., Proc. Natl. Acad. Sci.. USA , 88:11120-11123 

(1991) ), by altering the CDR3 regions of the cloned 
heavy chain genes of the library (Barbas et al., Proc. 
Natl. Acad. Sci.. USA . 89:4457-4461 (1992)), and by 

25 introducing random mutations into the library by 

error-prone polymerase chain reactions (PCR) [Gram et 
al., Proc. Natl.. Acad. Sci.. USA . 89:3576-3580 

(1992) ]. 

Filamentous phage display vectors have also been 
30 utilized to produce human monoclonal antibodies 

immunoreactive with hepatitis B virus (HBV) or HIV 
antigens. See, for example Zebedee et al., Proc. 
Natl. Acad. Sci. . USA . 89:3175-3179 (1992); and Burton 
et al., Proc. Natl. Acad. Sci.. USA . 88:10134-10137 
35 (1991), respectively. None of the previously > 
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described human monoclonal antibodies produced by 
phagemid vectors that are immunoreactive with HIV have 
been shown to neutralize HIV. 

5 Brief Description of the Invention 

Methods have now been discovered using the 
phagemid vectors to identify and isolate from 
combinatorial libraries human monoclonal antibodies 
that neutralize HIV, and allow the rapid preparation 

10 of large numbers of neutralizing antibodies of 
completely human derivation. The identified 
neutralizing antibodies define new epitopes on the HIV 
gpl20 and gp41 glycoproteins, thereby increasing the 
availability of new immunotherapeutic human monoclonal 

15 antibodies . 

The invention provides human monoclonal 
antibodies that neutralize HIV, and also provides cell 
lines used to produce these monoclonal antibodies. 
Also provided are amino acid sequences which 

20 confer neutralization function to the antigen binding 
domain of a monoclonal antibody, and which can be used 
immunogenically to identify other antibodies that 
specifically bind and neutralize HIV. The monoclonal 
antibodies of the invention find particular utility as 

25 reagents for the diagnosis and immunotherapy of HIV- 
induced disease. 

A major advantage of the monoclonal antibodies of 
the invention derives from the fact that they are 
encoded by a human polynucleotide sequence. Thus, jji 

30 vivo use of the monoclonal antibodies of the invention 
for diagnosis and immunotherapy of HIV- induced disease 
greatly reduces the problems of significant host 
immune response to the passively administered 
antibodies which is a problem commonly encountered 

35 when monoclonal antibodies of xenogeneic or chimeric 



derivation are utilized. 

In one embodiment, the invention contemplates a 
human monoclonal antibody capable of immunoreacting 
with human immunodeficiency virus (HIV) glycoprotein 
gpl20 and neutralizing HIV* A preferred human 
monoclonal antibody has the binding specificity of a 
monoclonal antibody comprising a heavy chain 
immunoglobulin variable region amino acid residue 
sequence selected from the group consisting of SEQ ID 
NOs 66, 67, 68, 70, 72, 73, 74, 75, 78 and 97. 
Another preferred human monoclonal antibody has the 
binding specificity of a monoclonal antibody 
comprising a light chain immunoglobulin variable 
region amino acid residue sequence selected from the 
group consisting of SEQ ID NOs 95, 96, 97, 98, 101, 
102, 103, 104, 105, 107, 110, 115, 118, 121, 122, 124 
and 132. 

In a further embodiment, the invention 
contemplates a human monoclonal antibody capable of 
immunoreacting with human immunodeficiency virus (HIV) 
glycoprotein gp41 and neutralizing HIV. A preferred 
human monoclonal antibody has the binding specificity 
of a monoclonal antibody comprising a heavy chain 
immunoglobulin variable region amino acid residue 
sequence selected from the group consisting of SEQ ID 
NOs 142, 143, 144, 145 and 146. Another preferred 
human monoclonal antibody has the binding specificity 
of a monoclonal antibody comprising a light chain 
immunoglobulin variable region amino acid residue 
sequence selected from the group consisting of SEQ ID 
NOs 147, 148, 149, 150 and 151. 

In another embodiment, the invention describes a 
polynucleotide sequence encoding a heavy or light 
chain immunoglobulin variable region amino acid 
residue sequence portion of a human monoclonal 
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antibody of this invention. Also contemplated are DNA 
expression vectors containing the polynucleotide, and 
host cells containing the vectors and polynucleotides 
of the invention. 
5 The invention also contemplates a method of 

detecting human immunodeficiency virus (HIV) 
comprising contacting a sample suspected of containing 
HIV with a diagnostically effective amount of the 
monoclonal antibody of this invention, and determining 

10 whether the monoclonal antibody immunoreacts with the 
sample. The method can be practiced in vitro or in 
vivo , and may include a variety of methods for 
determining the presence of an immunoreaction product. 
In another embodiment, the invention describes a 

15 method for providing passive immunotherapy to human 
immunodeficiency virus (HIV) disease in a human, 
comprising administering to the human an 
immunotherapeutically effective amount of the 
monoclonal antibody of this invention. The 

20 administration can be provided prophylactically, and 
by a parenteral administration. Pharmaceutical 
compositions containing one or more of the different 
human monoclonal antibodies are described for use in 
the therapeutic methods of the invention. 

25 

Brief Description of the Drawings 

In the drawings forming a portion of this 
disclosure : 

Figure 1 illustrates the sequence of the 

30 double-stranded synthetic DNA inserted into Lambda Zap 
to produce a Lambda Hc2 expression vector. The 
preparation of the double-stranded synthetic DNA 
insert is described in Example la2) . The various 
features required for this vector to express the 

35 V H -coding DNA homologs include the Shine-Dalgarno 
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ribosome binding site, a leader sequence to direct the 
expressed protein to the periplasm as described by 
Mouva et al., J. Biol. Chem. , 255:27, 1980, and 
various restriction enzyme sites used to operatively 
5 link the V H homologs to the expression vector. The V H 
expression vector sequence also contains a short 
nucleic acid sequence that codes for amino acids 
typically found in variable regions heavy chain (V H 
backbone) . This V H backbone is just upstream and in 

10 the proper reading as the V H DNA homologs that are 
operatively linked into the Xho I and Spe I cloning 
sites. The sequences of the top and bottom strands of 
the double-stranded synthetic DNA insert are listed 
respectively in SEQ ID NO 1 and SEQ ID NO 2. The ten 

15 amino acid sequence comprising the decapeptide tag is 
listed in SEQ ID NO 5. The synthetic DNA insert is 
directionally ligated into Lambda Zap II digested with 
the restriction enzymes Not 1 and Xho I to form Lambda 
Hc2 expression vector. 

20 Figure 2 illustrates the major features of the 

bacterial expression vector Lambda Hc2 (V H expression 
vector) . The orientation of the insert in Lambda Zap 
II is shown. The V H DNA homologs are inserted into 
the Xho I and Spe I cloning sites. The read through 

25 transcription produces the decapeptide epitope (tag) 
that is located just 3 1 of the cloning site. The 
amino acid residue sequence of the decapeptide tag and 
the Pel B leader sequence /spacer are respectively 
listed in SEQ ID NO 5 and 6. 

30 Figure 3 illustrates the sequence of the double- 

stranded synthetic DNA inserted into Lambda Zap to 
produce a Lambda Lc2 expression vector. The various 
features required for this vector to express the V L - 
coding DNA homologs are described in Figure 1. The 

35 V L -coding DNA homologs are operatively linked into the 
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Lc2 sequence at the Sac I and Xho I restriction sites. 
The sequences of the top and bottom strands of the ; 
double-stranded synthetic DNA insert are listed 
respectively in SEQ ID NO 3 and SEQ ID NO 4. The 
5 synthetic DNA insert is directionally li gated into , 
Lambda Zap II digested with the restriction enzymes 
Sac I and Not I to form Lambda Lc2 expression vector. 

Figure 4 illustrates the major features of the 
bacterial expression vector Lc2 (V L expression 

10 vector) . The synthetic DNA sequence from Figure 3 is 
shown at the top along with the Lac Z promoter from 
Lambda Zap II. The orientation of the insert in 
Lambda Zap II is shown. The V L DNA homologs are 
inserted into the Sac I and Xho I cloning sites. The ; v 

15 amino acid residue sequence of the Pel B leader 
sequence/ spacer is listed in SEQ ID NO 7. 

Figure 5 illustrates the dicistronic expression 
vector, pComb, in the form of a phagemid expression 
vector. 

20 Figure 6 illustrates the neutralization of HIV-1 

by recombinant Fabs. The same supemate preparations 
were used in p24 and syncytia assays. The figures 
indicate neutralization titers. Refer to Example 3 
for details of the assay procedures and discussion of 

25 the results. The ELISA titers and Fab concentrations 

were determined as described in Example 2b. 

Figure 7 illustrates the relative affinities of 
Fab fragments for gpl20 (IIIB) as illustrated by 
inhibition ELISA performed as described in Example 

30 2b6) . Fabs 27, 6, 29, 2 and 3 are all prototype 

members of the different groups discussed in Example 
4. Loop 2 is an Fab fragment selected from the same 
library as the other Fabs but which recognizes the V3 
loop. The data is plotted as the percentage of 

35 maximum binding on the Y-axis against increasing 
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concentrations (10~ 11 M to 10" 7 M) of soluble gp!20 on 
the X-axis. 

Figure 8 illustrates the soluble CD4 competition 
with Fab fragments for gpl20 (IIIB) . P4D10 and loop2 
5 are controls* P4D10 is a mouse monoclonal antibody 
reacting with the V3 loop of gpl20 (IIIB). The data, 
discussed in Example 2b6) , is plotted as described in 
Figure 7, 

Figure 9 illustrates the neutralization of HIV by 

10 purified Fabs prepared as described in Example 3. The 
results shown are derived from the syncytia assay 
using the MN strain. The data is plotted as percent 
of inhibition of binding on the Y-axis against 
increasing Fab concentrations [0.1 to greater than 10 

15 micrograms/milliliter (Mg/ml) ] on the X-axis. 

Figure 10 illustrates the amino acid residue 
sequences of variable heavy (V H ) domains of Fabs 
binding to gpl20. Seven distinct groups have been 
identified as described in Example 4 based on sequence 

20 homology. Identity with the first sequence in a group 
is indicated by dots. The Fab clone names are 
indicated in the left hand column. The corresponding 
SEQ ID NOs are indicated in the right hand column. 
The sequenced regions from right to left are framework 

25 region 1 (FR1) , complementary determining region 1 
(CDR1), framework region 2 (FR2) , complementary 
determining region 2 (CDR2), framework region 3 (FR3) , 
complementary determining region 3 (CDR3) , and 
framework region 4 (FR4). The five amino-terminal 

30 residue sequence beginning with LEQ arises from the 
VHla while the 5 amino-terminal residue sequence 
beginning with LEE arises from the VH3a primers. The 
bll and b29 sequences are very similar to the b3 group 
and could be argued to be intraclonal variants within 

35 that group; they are placed in their own group because 
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of differences at the V-D and D-J interface. 

Figure 11 illustrates the amino acid residue 
sequences of variable light (V L ) domains of Fabs 
binding to gp!20. Refer to Figure 10 for the 
5 description of the figure and to Example 4 for 
analysis of the sequences* 

Figure 12 illustrates the amino acid residue 
sequences of V L domains from Fabs binding to gpl20 and 
generated by shuffling the heavy chain from clone bl2 

10 against a library of light chains (H12-LCn Fabs) as 
described in Example 5. Note that the new V L 
sequences have designated clone numbers that do not 
relate to those numbers from the original library. 
The unique sequences are listed in the Sequence 

15 Listing from SEQ ID NO 114 to 122. The new V L domain 
sequences are compared to that of the original clone 
bl2 V L sequence. 

Figure 13 illustrates the amino acid residue 
sequences of V H domains from Fabs binding to gpl20 and 

20* generated by shuffling the light chain from clone b!2 
against a library of heavy chains (L12-HCn Fabs) as 
described in Example 5. Note that the new V H 
sequences have designated clone numbers that do not 
relate to those numbers from the original library. 

25 The unique sequences are listed in the Sequence 

Listing from SEQ ID NO 123 to 132. The new V H domain 
sequences are compared to that of the original clone 
bl2 V H sequence. 

Figure 14 illustrates, in two figures, Figure 14A 

30 and 14B, plasmid maps of the heavy (pTACOlH) and light 
chain (pTCOl) replicon-compatible chain-shuffling 
vectors, respectively. Both plasmids are very similar 
in the section containing the promoter and the cloning 
site. Abbreviations: tacPO, tac promoter/operon; 5 

35 histidine amino acid residue tag (histidine) 5-tail; 
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fllG, intergenic region of fl-phage; stu, stuff er 
fragment ready for in-frame replacement by light and 
heavy chain, respectively; cat, chloramphenicol 
transferase gene; bla, b- lactamase gene; ori, origin 
5 of replication. The map is drawn approximately to 
scale. 

Figure 15 illustrates the nucleotide sequences of 
the binary shuffling vectors in two Figures, 15A and 
15B. The construction and use of the vectors is 

10 described in Example 6. In Figure 15A, the 

double-stranded nucleotide sequence of the multiple 
cloning site in light chain vector, pTCOl, is shown. 
The sequences of the top and bottom nucleotide base 
strands are listed respectively in SEQ ID NO 8 and SEQ 

15 ID NO 9. The amino acid residue sequence comprising 
the pelB leader ending in the Sac I restriction site 
is listed in SEQ ID NO 10. In Figure 15B, the 
nucleotide sequence of the multiple cloning site in 
heavy chain vector, pTACOlH, is shown. The sequences 

20 of the top and bottom nucleotide base strands are 

listed respectively in SEQ ID NO 11 and SEQ ID NO 12. 
The amino acid residue sequence comprising the pelB 
leader ending in the Xho I restriction site is listed 
as SEQ ID NO 13. The amino acid residue sequence 

25 comprising the histidine tail is listed in SEQ ID NO 
14. Relevant restriction sites are underlined, tac 
promoter and ribosome binding site (rbs) are indicated 
by boxes. 

Figure 16 illustrates the complete set of 
30 directed crosses between heavy and light chains of all 
Fab fragments isolated from the original library by 
panning with gpl60 (IIIB) (bl-b27) , gpl20 (IIIB) 
(B8-B35) , gpl20 (SF2) (s4-s8) , and the loop peptide 
(p35) assayed by ELISA against IIIB gpl20 as described 
35 in Example 6. Heavy chains are listed horizontally 
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and light chains are listed vertically. Clones are 
sorted according to the grouping established in 
Example 4. Different groups are separated by 
horizontal and vertical lines. A at the 
intersection of a particular heavy chain and light 
chain signifies a clear negative (a signal of 3 times 
background or less) for that particular cross, a "+ w 
shows a clear positive comparable to the original 
heavy and light chain combination, and a "w" denotes 
an intermediate value in the ELISA. the HCp35/ 

LCp35 combination is negative when gp!20 (IIIB) is 
used, but positive when assayed with gpl20 (IIIB) . 
Identical chains carry the same identifier (either *, 
5, §, or ¥). 

Figure 17 illustrates the affinity of 
antibody-antigen interaction for b!2 heavy chain 
crosses with light chains from all pannings analyzed 
by competitive ELISA using soluble IIIB gpl20 as 
competing antigen as described in Example 6. The data 
is plotted as the percentage of maximum binding on the 
Y-axis against increasing concentrations of soluble 
gpl20 (IIIB) (10' 12 M to 10" 7 M) on the X-axis* 

Figure 18 illustrates the amino acid residue 
sequences of variable heavy (V H ) domains of Fabs 
binding to gp41. The Fab clone names are indicated in 
the left hand column. The heavy chain sequences of 
the five Fabs individually designated DL 41 19, DO 41 
11, GL 41 1, MT 41 12 and SS 41 8 have been assigned 
the respective SEQ ID NOs 142, 143, 144, 145 and 146. 
The sequenced regions from right to left are framework 
region 1 (FR1) , complementary determining region 1 
(CDR1) , framework region 2 (FR2) , complementary 
determining region 2 (CDR2) , framework region 3 (FR3) , 
complementary determining region 3 (CDR3), and 
framework region 4 (FR4) . 
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Figure 19 illustrates the amino acid residue 
sequences of variable light (V L ) domains of Fabs 
binding to gp4l. Refer to Figure 18 for the 
description of the figure. The light chain sequences 
5 of the five Fabs individually designated DL 41 19, DO 
41 11, GL411, MT41 12 arid SS 41 8 have been 
assigned the respective SEQ ID NOs 147, 148, 149, 150 
and 151. 

10 Detailed Description of the Invention 
A. Definitions 

Amino Acid Residue : An amino acid formed 
upon chemical digestion (hydrolysis) of a polypeptide 
at its peptide linkages. The amino acid residues : ; 

15 described herein are preferably in the "L M isomeric 

form. However, residues in the "D" isomeric form can 
be substituted for any L-amino acid residue, as long 
as the desired functional property is retained by the 
polypeptide. NH 2 refers to the free amino group 

20 present at the amino terminus of a polypeptide. COOH 
refers to the free carboxy group present at the 
carboxy terminus of a polypeptide. In keeping with 
standard polypeptide nomenclature (described in J. 
Biol. Chem. , 243:3552-59 (1969) and adopted at 37 CFR 

25 §1.822(b)(2)), abbreviations for amino acid residues 
are shown in the following Table of Correspondence: . 

TABLE OF CORRESPONDENCE 

SYMBOL AMINO ACID 

30 1-Letter 3 -Letter 

Y Tyr tyrosine 

G Gly glycine 

F Phe phenylalanine 

M Met methionine 

35 A Ala alanine 
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s 


Ser 


serine 


I 


lie 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


Z 


Glx 


Glu and/ or Gin 


W 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


B 


Asx 


Asn and /or Asp 


C 


Cys 


cysteine 


X 


Xaa 


Unknown or other 



It should be noted that all amino acid residue 

20 sequences represented herein by formulae have a left- 
to-right orientation in the conventional direction of - 
amino terminus to carboxy terminus* In addition, the 
phrase "amino acid residue" is broadly defined to 
. include the amino acids listed in the Table of 

25 Correspondence and modified and unusual amino acids, 
such as those listed in 37 CFR 1.822(b)(4), and 
incorporated herein by reference. Furthermore, it 
should be noted that a dash at the beginning or end of 
an amino acid residue sequence indicates a peptide 

30 bond to a further sequence of one or more amino acid 

residues or a covalent bond to an amino-terminal group 
such as NH 2 or acetyl or to a carboxy-terminal group 
such as COOH. 

Recombinant DNA frDNA) molecule : A DNA molecule 

35 produced by operatively linking two DNA segments. 
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Thus, a recombinant DNA molecule is a hybrid DNA 
molecule comprising at least two nucleotide sequences 
not normally found together in nature. rDNA 1 s not 
having a common biological origin, i.e., 
evolutionarily different, are said to be 
"heterologous" . 

Vector : A rDNA molecule capable of autonomous 
replication in a cell and to which a DNA segment, 
e.g., gene or polynucleotide, can be operatively 
linked so as to bring about replication of the 
attached segment. Vectors capable of directing the 
expression of genes encoding for one or more 
polypeptides are referred to herein as "expression 
vectors". Particularly important vectors allow 
cloning of cDNA (complementary DNA) from mRNAs 
produced using reverse transcriptase. 

Receptor ; A receptor is a molecule, such as a 
protein, glycoprotein and the like, that can 
specifically (non-randomly) bind to another molecule. 

Antibody ; The term antibody in its various 
grammatical forms is used herein to refer to 
immunoglobulin molecules and immunologically active 
portions of immunoglobulin molecules, i.e., molecules 
that contain an antibody combining site or paratope. 
Exemplary antibody molecules are intact immunoglobulin 
molecules, substantially intact immunoglobulin 
molecules and portions of an immunoglobulin molecule, 
including those portions known in the art as Fab, 
Fab 1 , F(ab*) 2 and F(v) . 

Antibody Combining Site ; An antibody combining 
site is that structural portion of an antibody 
molecule comprised of a heavy and light chain variable 
and hypervariable regions that specifically binds 
(immunoreacts with) an antigen. The term immunoreact 
in its various forms means specific binding between an 
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antigenic determinant-containing molecule and a 
molecule containing an antibody combining site such as 
a whole antibody molecule or a portion thereof. 

Monoclonal Antibody ; A monoclonal antibody in 
5 its various grammatical forms refers to a population 

of antibody molecules that contain only one species of 
antibody combining site capable of immunoreacting with 
a particular epitope. A monoclonal antibody thus 
typically displays a single binding affinity for any 

10 epitope with which it immunoreacts . A monoclonal 

antibody may therefore contain an antibody molecule 
having a plurality of antibody combining sites, each 
immunospecif ic for a different epitope, e.g., a 
bispecific monoclonal antibody. Although historically 

15 a monoclonal antibody was produced by immortalization 
of a clonally pure immunoglobulin secreting cell line, 
a monoclonally pure population of antibody molecules 
can also be prepared by the methods of the present 
invention. 

20 Fusion Polypeptide : A polypeptide comprised of 

at least two polypeptides and a linking sequence to 
operatively link the two polypeptides . into one 
continuous polypeptide. The two polypeptides linked 
in a fusion polypeptide are typically derived from two 

25 independent sources, and therefore a fusion 

polypeptide comprises two linked polypeptides not 
normally found linked in nature. 

Upstream : In the direction opposite to the 
direction of DNA transcription, and therefore going 

30 from 5* to 3' on the non-coding. strand, or 3 1 to 5' on 
the mRNA. 

Downstream : Further along a DNA sequence in the 
direction of sequence transcription or read out, that 
is traveling in a 3'- to 5 1 -direction along the non- 
35 coding strand of the DNA or 5'- to 3 1 -direction along 
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the RNA transcript. 

Cistron : Sequence of nucleotides in a DNA 
molecule coding for an amino acid residue sequence and 
including upstream and downstream DNA expression 
5 control elements. 

Leader Polypeptide : A short length of amino acid 
sequence at the amino end of a polypeptide, which 
carries or directs the polypeptide through the inner 
membrane and so ensures its eventual secretion into 
10 the periplasmic space and perhaps beyond. The leader 
sequence peptide is commonly removed before the 
polypeptide becomes active. 

Reading Frame ; Particular sequence of contiguous 
nucleotide triplets (codons) employed in translation. 
15 The reading frame depends on the location of the 
translation initiation codon. 

B. Human Monop^ona! Antibodies 

The present invention relates to human mono- 

20 clonal antibodies which are specific for, and 

neutralize human immunodeficiency virus (HIV) . In a 
preferred embodiment of the invention , human 
monoclonal antibodies are disclosed which are capable 
of binding epitopic polypeptide sequences in 

25 glycoprotein gpl20 of HIV. A further preferred 

embodiment are human monoclonal antibodies capable of 
binding epitopic polypeptide sequences in glycoprotein 
gp 41 of HIV. Also disclosed is an antibody having a 
specified amino acid sequence, which sequence confers 

30 the ability to bind a specific epitope and to 

neutralize HIV when the virus is bound by these 
antibodies. A human monoclonal antibody with a 
claimed specificity, and like human monoclonal 
antibodies with like specificity, are useful in the 

35 diagnosis and immunotherapy of HIV- induced disease. 
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The term ^HIV- induced disease" means any disease 
caused, directly or indirectly, by HIV. An example of 
a HIV-induced disease is acquired autoimmunodef iciency 
syndrome (AIDS) , and any of the numerous conditions 
5 associated generally with AIDS which are caused by HIV 
infection. 

Thus, in one aspect, the present invention is 
directed to human monoclonal antibodies which are 
reactive with a HIV neutralization site and cell lines 

10 which produce such antibodies. The isolation of cell 
lines producing monoclonal antibodies of the invention 
is described in great detail further herein, and can 
be accomplished using the phagemid vector library 
methods described herein, and using routine screening 

15 techniques which permit determination of the 

elementary immunoreaction and neutralization patterns 
of the monoclonal antibody of interest. Thus, if a 
human monoclonal antibody being tested binds and 
neutralizes HIV in a manner similar to a human 

20 monoclonal antibody produced by the cell lines of the 
invention then the tested antibody is considered 
equivalent to an antibody of the invention. 

It is also possible to determine, without undue 
experimentation, if a human monoclonal antibody has 

25 the same (i.e., equivalent) specificity as a human 

monoclonal antibody of this invention by ascertaining 
whether the former prevents the latter from binding to 
HIV. If the human monoclonal antibody being tested 
competes with the human monoclonal antibody of the 

30 invention, as shown by a decrease in binding by the 

human monoclonal antibody of the invention in standard 
competition assays for binding to a solid phase 
antigen, for example to gpl20, then it is likely that 
the two monoclonal antibodies bind to the same, or a 

35 closely related, epitope. 
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Still another way to determine whether a human 
monoclonal antibody has the specificity of a human 
monoclonal antibody of the invention is to pre- 
incubate the human monoclonal antibody of the 
5 invention with HIV with which it is normally reactive, 
and then add the human monoclonal antibody being 
tested to determine if the human monoclonal antibody 
being tested is inhibited in its ability to bind HIV. 
If the human monoclonal antibody being tested is 

10 inhibited then, in all likelihood, it has the same, or 
functionally equivalent, epitopic specificity as the 
monoclonal antibody of the invention. Screening of 
human monoclonal antibodies of the invention, can be 
also carried out utilizing HIV neutralization assays 

15 and determining whether the monoclonal antibody 
neutralizes HIV. 

The ability to neutralize HIV at one or more 
stages of virus infection is a desirable quality of a 
human monoclonal antibody of the present invention. 

20 Virus neutralization can be measured by a variety of 

in vitro and in vivo methodologies. Exemplary methods 
described herein for determining the capacity for 
neutralization are the in vitro assays that measure 
inhibition of HIV- induced syncytia formation, plaque 

25 assays and assays that measure the inhibition of 

output of core p24 antigen from a cell infected with 
HIV. 

As shown herein, the immunospecif icity of a human 
monoclonal antibody of this invention can be directed 

30 to epitopes that are shared across serotypes and/ or 

strains of HIV, or can be specific for a single strain 
of HIV, depending upon the epitope. Thus, a preferred 
human monoclonal antibody can immunoreact with HIV-1, 
HIV-2, or both, and can immunoreact with one or more 

35 of the HIV-1 strains IIIB, MN, RF, SF-2, Z2, 26, CDC4, 
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ELI and the like strains. 

The immunospecif icity of an antibody, its HIV- 
neutralizing capacity, and the attendant affinity the 
antibody exhibits for the epitope, are defined by the 
5 epitope with which the antibody immunoreacts . The 

epitope specificity is defined at least in part by the 
amino acid residue sequence of the variable region of 
the heavy chain of the immunoglobulin the antibody, 
and in part by the light chain variable region amino 

10 acid residue sequence. Preferred human monoclonal 
antibodies immunoreact with the CD4 binding site of 
glycoprotein gpl20. 

A preferred human monoclonal antibody of this 
invention has the binding specificity of a monoclonal 

15 antibody comprising a heavy chain immunoglobulin 

variable region amino acid residue sequence selected 
from the group of sequences consisting of SEQ ID NOs 
66, 67, 68, 70, 72, 73, 74, 75, 78 and 97, and 
conservative substitutions thereof. 

20 Another preferred human monoclonal antibody of 

this invention' has the binding specificity of a 
monoclonal antibody having a light chain 
immunoglobulin variable region amino acid residue 
sequence selected from the group of sequences 

25 consisting of SEQ ID NOs 95, 96, 97, 98, 101, 102, 

103, 104 , 105, 107, 110, 115, 118, 121, 122, 124 and 
132, and conservative substitutions thereof. 

Further preferred human monoclonal antibodies 
immunoreact with the CD4 binding site of glycoprotein 

30 gp41. A preferred human monoclonal antibody of this 
invention has the binding specificity of a monoclonal 
antibody comprising a heavy chain immunoglobulin 
variable region amino acid residue sequence selected 
from the group of sequences consisting of SEQ ID NOs 

35 142, 143, 144, 145, and 146 and conservative 
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substitutions thereof. 

Another preferred human monoclonal antibody of 
this invention has the gp41 binding specificity of a 
monoclonal antibody having a light chain 
immunoglobulin variable region amino acid residue 
sequence selected from the group of sequences 
consisting of SEQ ID NOs 147, 148, 149, 150, and 151 
and conservative substitutions thereof. 

As shown by the present teachings and using the 
combinatorial library shuffling and screening methods, 
one can identify new heavy and light chain pairs (H:L) 
that function as a HIV-neutralizing monoclonal 
antibody. In particular, one can shuffle a known 
heavy chain, derived from an HIV-neutralizing human 
monoclonal antibody, with a library of light chains to 
identify new H:L pairs that form a functional antibody 
according to the present invention. Similarly, one 
can shuffle a known light chain, derived from an HIV- 
neutralizing human monoclonal antibody, with a library 
of heavy chains to identify new H:L pairs that form a 
functional antibody according to the present 
invention. 

Particularly preferred human monoclonal 
antibodies are those having the gpl20 immunoreaction 
(binding) specificity of a monoclonal antibody having 
heavy and light chain immunoglobulin variable region 
amino acid residue sequences in pairs (H:L) selected 
from the group consisting of SEQ ID NOs 66:95, 67:96, 
72:102, 66:97, 73:107, 74:103, 70:101, 68:98, 75:104, 
72:105, 78:110, 66:118, 66:122, 66:121, 66:115, 
97:124, 97:132 and 66:98, and conservative 
substitutions thereof. The designation of two SEQ ID 
NOs with a colon, e.g., 66:95, is to connote a H:L 
pair formed by the heavy and light chain, 
respectively, amino acid residue sequences shown in 
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SEQ ID NO 66 and SEQ ID NO 95, respectively. 

Further preferred human monoclonal antibodies are 
those having the gp41 immunoreaction (binding) 
specificity of a monoclonal antibody having heavy and 
light chain immunoglobulin variable region amino acid 
residue sequences in pairs (H:L) selected from the 
group consisting of SEQ ID NOs 142:147, 143:148, 
144:149, 145:150, and 146:151, and conservative 
substitutions thereof* 

Particularly preferred are human monoclonal 
antibodies having the binding specificity of the 
monoclonal antibody produced by the B. coli 
microorganisms deposited with the ATCC, as described 
further herein. 

Particularly preferred are human monoclonal 
antibodies having the binding specificity of the 
monoclonal antibodies produced by the E. coli 
microorganisms designated ATCC 69078, 69079 and 69080. 
By "having the binding specificity" is meant 
equivalent monoclonal antibodies which exhibit the 
same or similar immunoreaction and neutralization 
properties, and which compete for binding to an HIV 
antigen. Preferred are the human monoclonal 
antibodies produced by ATCC 69078, 69079 and 69080. 

The term "conservative variation" as used herein 
denotes the replacement of an amino acid residue by 
another, biologically similar residue. Examples of 
conservative variations include the substitution of 
one hydrophobic residue such as iso leucine, valine, 
leucine or methionine for another, or the substitution 
of one polar residue for another, such as the 
substitution of arginine for lysine, glutamic for 
aspartic acids, or glutamine for asparagine, and the 
like. The term "conservative variation" also includes 
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the use of a substituted amino acid in place of an 
unsubstituted parent amino acid provided that 
antibodies having the substituted polypeptide also 
neutralize HIV. Analogously, another preferred 
5 embodiment of the invention relates to polynucleotides 
which encode the above noted heavy and/ or light chain 
polypeptides and to polynucleotide sequences which are 
complementary to these polynucleotide sequences. 
Complementary polynucleotide sequences include those 

10 sequences which hybridize to the polynucleotide 
sequences of the invention under stringent 
hybridization conditions. 

By using the human monoclonal antibodies of the 
invention, it is now possible to produce anti- 

15 idiotypic antibodies which can be used to screen human 
monoclonal antibodies to identify whether the antibody 
has the same binding specificity as a human monoclonal 
antibody of the invention and also used for active 
immunization (Herlyn et al., Science . 232:100 (1986)). 

20 Such anti-idiotypic antibodies can be produced using 
well-known hybridoma techniques (Kohler et al., 
Nature . 256:495 (1975)). Ah anti-idiotypic antibody 
is an antibody which recognizes unique determinants 
present on the human monoclonal antibody produced by 

25 the cell line of interest. These determinants are 

located in the hypervariable region of the antibody. 
It is this region which binds to a given epitope and, 
thus, is responsible for the specificity of the 
antibody. An anti-idiotypic antibody can be prepared 

30 by immunizing an animal with the monoclonal antibody 
of interest. The immunized animal will recognize and 
respond to the idiotypic determinants of the 
immunizing antibody and produce an antibody to these 
idiotypic determinants. By using the anti-idiotypic 

35 antibodies of the immunized animal, which are specific 
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for the human monoclonal antibody of the invention 
produced by a cell line which was used to immunize the ' 
second animal, it is now possible to identify other 
clones with the same idiotype as the antibody of the 
5 hybridoma used for immunization. Idiotypic identity 
between human monoclonal antibodies of two cell lines 
demonstrates that the two monoclonal antibodies are 
the same with respect to their recognition of the same 
epitopic determinant. Thus, by using anti-idiotypic 

10 antibodies, it is possible to identify other 

hybridomas expressing monoclonal antibodies having the 
same epitopic specificity. 

It is also possible to use the anti-idiotype 
technology to produce monoclonal antibodies which ; ; Vy; ^ 

15 mimic an epitope. For example, an anti-idiotypic 
monoclonal antibody made to a first monoclonal 
antibody will have a binding domain in the 
hypervariable region which is the "image" of the 
epitope bound by the first monoclonal antibody. Thus, 

20 the anti-idiotypic monoclonal antibody can be used for 
immunization, since the anti-idiotype monoclonal 
antibody binding domain effectively acts as an 
antigen. 

In one preferred embodiment, the invention 
25 contemplates a truncated immunoglobulin molecule 
comprising a Fab fragment derived from a human 
monoclonal antibody of this invention. The Fab 
fragment, lacking Fc receptor, is soluble, and affords 
therapeutic advantages in serum half life, and 
30 diagnostic advantages in modes of using the soluble 
Fab fragment. The preparation of a soluble Fab 
fragment is generally known in the immunological arts 
and can be accomplished by a variety of methods. A 
preferred method of producing a soluble Fab fragment 
35 is described herein. 
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C. Immunotherapeutic Methods and Compositions 
The human monoclonal antibodies can also be 
used immunotherapeutically for HIV disease. The term 
"immunotherapeutically" or "immunotherapy* 1 as used 
herein in conjunction with the monoclonal antibodies 
of the invention denotes both prophylactic as veil as 
therapeutic administration. Thus, the monoclonal 
antibodies can be administered to high-risk patients 
in order to lessen the likelihood and/or severity of 
HIV- induced disease, administered to patients already 
evidencing active HIV infection, or administered to 
patients at risk of HIV infection. 

1. Therapeutic Compositions 

The present invention therefore 
contemplates therapeutic compositions useful for 
practicing the therapeutic methods described herein. 
Therapeutic compositions of the present invention 
contain a physiologically tolerable carrier together 
with at least one species of human monoclonal antibody 
as described herein, dissolved or dispersed therein as 
an active ingredient. In a preferred embodiment, the 
therapeutic composition is not immunogenic when 
administered to a human patient for therapeutic 
purposes, unless that purpose is to induce an immune 
response, as described elsewhere herein. 

As used herein, the terms "pharmaceutical ly 
acceptable", "physiologically tolerable" and 
grammatical variations thereof, as they refer to 
compositions, carriers, diluents and reagents, are 
used interchangeably and represent that the materials 
are capable of administration to or upon a human 
without the production of undesirable physiological 
effects such as nausea, dizziness, gastric upset and 
the like. 



The preparation of a pharmacological composition 
that contains active ingredients dissolved or 
dispersed therein is well understood in the art. 
Typically such compositions are prepared as sterile 
injectables either as liquid solutions or suspensions, 
aqueous or non-aqueous, however, solid forms suitable 
for solution, or suspensions, in liquid prior to use 
can also be prepared. The preparation can also be 
emulsified. 

The active ingredient can be mixed with 
excipients which are pharmaceutically acceptable and 
compatible with the active ingredient and in amounts 
suitable for use in the therapeutic methods described 
herein. Suitable excipients are, for example, water, 
saline, dextrose, glycerol, ethanol or the like and 
combinations thereof. In addition, if desired, the 
composition can contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH 
buffering agents and the like which enhance the 
effectiveness of the active ingredient. 

The therapeutic composition of the present 
invention can include pharmaceutically acceptable 
salts of the components therein. Pharmaceutically 
acceptable salts include the acid addition salts 
(formed with the free amino groups of the polypeptide) 
that are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such 
organic acids as acetic, tartaric, mandelic and the 
like. Salts formed with the free carboxyl groups can 
also be derived from inorganic bases such as, for 
example, sodium, potassium, ammonium, calcium or 
ferric hydroxides, and such organic bases as 
isopropylamine, trimethylamine, 2-ethylamino ethanol, 
histidine, procaine and the like. 

Physiologically tolerable carriers are well known 
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in the art. Exemplary of liquid carriers are sterile 
aqueous solutions that contain no materials in 
addition to the active ingredients and water, or 
contain a buffer such as sodium phosphate at 
physiological pH value, physiological saline or both, 
such as phosphate-buffered saline. Still further, 
aqueous carriers can contain more than one buffer 
salt, as well as salts such as sodium and potassium 
chlorides, dextrose, propylene glycol, polyethylene 
glycol and other solutes. 

Liquid compositions can also contain liquid 
phases in addition to and to the exclusion of water. 
Exemplary of such additional liquid phases are 
glycerin, vegetable oils such as cottonseed oil, 
organic esters such as ethyl oleate, and water-oil 
emulsions. 

A therapeutic composition contains an HIV- 
neutralizing of a human monoclonal antibody of the 
present invention, typically an amount of at least 0.1 
weight percent of antibody per weight of total 
therapeutic composition. A weight percent is a ratio 
by weight of antibody to total composition. Thus, for 
example, 0.1 weight percent is 0.1 grams of antibody 
per 100 grains of total composition. 

2. Therapeutic Methods 

In view of the demonstrated HIV 
neutralizing ability of the human monoclonal 
antibodies of the present invention, the present 
disclosure provides for a method for neutralizing HIV 
in vitro or in vivo . The method comprises contacting 
a sample believed to contain HIV with a composition 
comprising a therapeutically effective amount of a 
human monoclonal antibody of this invention. 

For in vivo modalities, the method comprises 
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administering to the patient a therapeutically 
effective amount of a physiologically tolerable 
composition containing a human monoclonal antibody of 
the invention. Thus, the present invention describes 
5 in one embodiment a method for providing passive 

immunotherapy to HIV disease in a human comprising 
administering to the human an immunotherapeutically 
effective amount of the monoclonal antibody of this 
invention. 

10 A representative patient for practicing the 

present passive immunotherapeutic methods is any human 
exhibiting symptoms of HIV- induced disease, including 
AIDS or related conditions believed to be caused by 
HIV infection, and humans at risk of HIV infection. 

15 Patients at risk of infection by HIV include babies of 
HIV-infected pregnant mothers, recipients of 
transfusions known to contain HIV, users of HIV 
contaminated needles, individuals who have 
participated in high risk sexual activities with known 

20 HIV-infected individuals, and the like risk 
situations. 

In one embodiment, the passive immunization 
method comprises administering a composition 
comprising more than one species of human monoclonal 

25 antibody of this invention, preferably directed to 
non-competing epitopes or directed to distinct 
serotypes or strains of HIV, as to afford increased 
effectiveness of the passive immunotherapy. 

A therapeutically (immunotherapeutically) 

30 effective amount of a human monoclonal antibody is a 

predetermined amount calculated to achieve the desired 
effect, i.e., to neutralize the HIV present in the 
sample or in the patient, and thereby decrease the 
amount of detectable HIV in the sample or patient. In 

35 the case of in vivo therapies, an effective amount can 
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be measured by improvements in one or more symptoms 
associated with HIV- induced disease occurring in the 
patient, or by serological decreases in HIV antigens. ; 
Thus, the dosage ranges for the administration of 
5 the monoclonal antibodies of the invention are those 
large enough to produce the desired effect in which 
the symptoms of the HIV disease are ameliorated or the 
likelihood of infection decreased. The dosage should 
not be so large as to cause adverse side effects, such 

10 as hyperviscosity syndromes, pulmonary edema, 

congestive heart failure, and the like. Generally, 
the dosage will vary with the age, condition, sex and 
extent of the disease in the patient and can be 
determined by one of skill in the art. 

15 The dosage can be adjusted by the individual 

physician in the event of any complication. 

A therapeutically effective amount of an antibody 
of this invention is typically an amount of antibody 
such that when administered in a physiologically 

20 tolerable composition is sufficient to achieve a 

plasma concentration of from about 0.1 microgram (ug) 
per milliliter (ml) to about 100 ug/ml, preferably 
from about 1 ug/ml to about 5 ug/ml, and usually about 
5 ug/ml. Stated differently, the dosage can vary from 

25 about 0.1 mg/kg to about 300 mg/kg, preferably from 
about 0.2 mg/kg to about 200 mg/kg, most preferably 
from about 0.5 mg/kg to about 20 mg/kg, in one or more 
dose administrations daily, for one or several days. 
The human monoclonal antibodies of the invention 

30 can be administered parenterally by injection or by 
gradual infusion over time. Although the HIV 
infection is typically systemic and therefore most 
often treated by intravenous administration of 
therapeutic compositions, other tissues and delivery 

35 means are contemplated where there is a likelihood 
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that the tissue targeted contains infectious HIV. 
Thus, human monoclonal antibodies of the invention can 
be administered intravenously, intraperitoneally, 
intramuscularly , subcutaneous ly , intracavity , 
5 transdermal ly, and can be delivered by peristaltic 
means . 

The therapeutic compositions containing a human 
monoclonal antibody of this invention are 
conventionally administered intravenously, as by 

10 injection of a unit dose, for example* The term "unit 
dose" when used in reference to a therapeutic 
composition of the present invention refers to 
physically discrete units suitable as unitary dosage 
for the subject, each unit containing a predetermined 

15 quantity of active material calculated to produce the 
desired therapeutic effect in association with the 
required diluent; i.e., carrier, or vehicle. 

The compositions are administered in a manner 
compatible with the dosage formulation, and in a 

20 therapeutically effective amount. The quantity to be 
administered depends on the subject to be treated, 
capacity of the subject's system to utilize the active 
ingredient, and degree of therapeutic effect desired. 
Precise amounts of active ingredient required to be 

25 administered depend on the judgement of the 

practitioner and are peculiar to each individual. 
However, suitable dosage ranges for systemic 
application are disclosed herein and depend on the 
route of administration. Suitable regimes for 

30 administration are also variable, but are typified by 
an initial administration followed by repeated doses 
at one or more hour intervals by a subsequent 
injection or other administration. Alternatively, 
continuous intravenous infusion sufficient to maintain 

35 concentrations in the blood in the ranges specified 
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for in vivo therapies are contemplated. 

As an aid to the administration of effective 
amounts of a monoclonal antibody, a diagnostic method 
for detecting a monoclonal antibody in the subject's 
5 blood is useful to characterize the fate of the 
administered therapeutic composition. 

The invention also relates to a method for 
preparing a medicament or pharmaceutical composition 
comprising the human monoclonal antibodies of the 
10 invention, the medicament being used for immunotherapy 

of HIV disease. 

D. Diagnostic Assay Methods 

The present invention contemplates various 

15 assay methods for determining the presence, and 
preferably amount, of HIV in a sample such as a 
biological fluid or tissue sample using a human 
monoclonal antibody of this invention as an 
immunochemical reagent to form an immunoreaction 

20 product whose amount relates, either directly or 
indirectly, to the amount of HIV in the sample. 

Those skilled in the art will understand that 
there are numerous well known clinical diagnostic 
chemistry procedures in which an immunochemical 

25 reagent of this invention can be used to form an 

immunoreaction product whose amount relates to the 
amount of HIV present in a body sample. Thus, while 
exemplary assay methods are described .herein, the 
invention is not so limited. 

30 Various heterogenous and homogeneous protocols, 

either competitive or noncompetitive, can be employed 
in performing an assay method of this invention. 
Examples of types of immunoassays which can utilize 
monoclonal antibodies of the invention are competitive 

35 and non-competitive immunoassays in either a direct or 
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indirect format* Examples of such immunoassays are 
the radioimmunoassay (RIA) and the sandwich 
(immunometric) assay. Detection of the antigens using 
the monoclonal antibodies of the invention can be done 
5 utilizing immunoassays which are run in either the 
forward, reverse, or simultaneous modes, including 
immunohistochemical assays on physiological samples. 
Those of skill in the art will know, or can readily 
discern, other immunoassay formats without undue 

10 experimentation . 

The monoclonal antibodies of the invention can be 
bound to many different carriers and used to detect 
the presence of HIV. Examples of well-known carriers 
include glass, polystyrene, polypropylene, 

15 polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides, agaroses and 
magnetite. The nature of the carrier can be either 
soluble or insoluble for purposes of the invention. 
Those skilled in the art will know of other suitable 

20 carriers for binding monoclonal antibodies, or will be 
able to ascertain such, using routine experimentation. 

There are many different labels and methods of 
labeling known to those of ordinary skill in the art. 
Examples of the types of labels which can be used in 

25 the present invention include enzymes, radioisotopes, 
fluorescent compounds, colloidal metals, 
chemiluminescent compounds, and bio-luminescent com- 
pounds. Those of ordinary skill in the art will know 
of other suitable labels for binding to the monoclonal 

30 antibodies of the invention, or will be able to 
ascertain such, using routine experimentation. 
Furthermore, the binding of these labels to the 
monoclonal antibodies of the invention can be done 
using standard techniques common to those of ordinary 

35 skill in the art. 
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For purposes of the invention, HIV may be 
detected by the monoclonal antibodies of the invention 
when present in samples of biological fluids and 
tissues. Any sample containing a detectable amount of 
5 HIV can be used. A sample can be a liquid such as 

urine, saliva, cerebrospinal fluid, blood, serum and 
the like, or a solid or semi-solid such as tissues, 
feces, and the like, or, alternatively, a solid tissue 
such as those commonly used in histological diagnosis. 

10 Another labeling technique which may result in 

greater sensitivity consists of coupling the 
antibodies to low molecular weight haptens. These 
haptens can then be specifically detected by means of 
a second reaction. For example, it is common to use 

15 haptens such as biotin, which reacts with avidin, or 
dinitrophenol, pyridoxal, or fluorescein, which can 
react with specific anti-hapten antibodies. 

The monoclonal antibodies of the invention are 
suited for use in vitro , for example, in immunoassays 

20 in which they can be utilized in liquid phase or bound 

to a solid phase carrier for the detection of HIV in 
samples, as described above. The monoclonal 
antibodies in these immunoassays can be detectably 
labeled in various ways for in vitro use. 

25 In using the human monoclonal antibodies of the 

invention for the in vivo detection of antigen, the 
detectably labeled human monoclonal antibody is given 
in a dose which is diagnostically effective. The term 
"diagnostically effective" means that the amount of 

30 detectably labeled human monoclonal antibody is 
administered in sufficient quantity to enable 
detection of the site having the HIV antigen for which 
the monoclonal antibodies are specific. 

The concentration of detectably labeled human 

35 monoclonal antibody which is administered should be 



sufficient such that the binding to HIV is detectable 
compared to the background. Further,! it is desirable ? 
that the detectably labeled monoclonal antibody be 
rapidly cleared from the circulatory system in order 
to give the best target-to-background signal ratio. 

As a rule, the dosage of detectably labeled human 
monoclonal antibody for in vivo diagnosis will vary 
depending on such factors as age, sex, and extent of 
disease of the individual. The dosage of human 
monoclonal antibody can vary from about 0.01 mg/m 2 to 
about 500 mg/m 2 , preferably 0.1 mg/m 2 to about 200 
mg/m 2 , most preferably about 0.1 mg/m 2 . to about 10 
mg/m 2 . Such dosages may vary, for example, depending 
on whether multiple injections are given, tissue, and / 
other factors known to those of skill in the art. 

For in vivo diagnostic imaging, the type of 
detection instrument available is a major factor in 
selecting a given radioisotope. The radioisotope 
chosen must have a type of decay which is detectable 
for a given type of instrument. Still another 
important factor in selecting a radioisotope for iji 
vivo diagnosis is that the half -life of the 
radioisotope be long enough so that it is still 
detectable at the time of maximum uptake by the 
target, but short enough so that deleterious radiation 
with respect to the host is minimized. Ideally, a 
radioisotope used for in vivo imaging will lack a 
particle emission, but produce a large number of 
photons in the 140-250 keV range, which may be readily 
detected by conventional gamma cameras. 

For in vivo diagnosis radioisotopes may be bound 
to immunoglobulin either directly or indirectly by 
using an intermediate functional group, intermediate 
functional groups which often are used to bind 
radioisotopes which exist as metallic ions to 
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immunoglobulins are the bifunctional chelating agents 

such as diethylenetriaminepentacetic acid (DTPA) and 

ethylenediaminetetraacetic acid (EDTA) and similar 

molecules. Typical examples of metallic ions which 

5 can be bound to the monoclonal antibodies of the 

invention are 111 In, ^Ru, 67 Ga, "Ga, ^As, w Zr, and 
zoi T1 ^ 

The monoclonal antibodies of the invention can 
also be labeled with a paramagnetic isotope for 
10 purposes of in vivo diagnosis, as in magnetic 

resonance imaging (MRI) or electron spin resonance 
(ESR) « in general, any conventional method for 
visualizing diagnostic imaging can be utilized. 
Usually gamma and positron emitting radioisotopes are 
15 used for camera imaging and paramagnetic isotopes for 
MRI. Elements which are particularly useful in such 
techniques include 157 Gd, 55 Mn, 162 Dy r ^Cr, and ^e. 

The human monoclonal antibodies of the invention 
can be used in vitro and in vivo to monitor the course 
20 of HIV disease therapy. Thus, for example, by 

measuring the increase or decrease in the number of 
cells infected with HIV or changes in the 
concentration of HIV present in the body or in various 
body fluids, it would be possible to determine whether 
25 a particular therapeutic regimen aimed at ameliorating 
the HIV disease is effective, 

E. Diagnostic Systems 

The present invention also describes a 
30 diagnostic system, preferably in kit form, for 

assaying for the presence of HIV in a sample according 
to the diagnostic methods described herein. A 
diagnostic system includes, in an amount sufficient to 
v ,., perform at least one assay, a subject human monoclonal 
35 antibody, as a separately packaged reagent. 
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In another embodiment, a diagnostic system is 
contemplated for assaying for the presence of an anti- 
HIV monoclonal antibody in a body fluid sample such as 
for monitoring the fate of therapeutically 
administered antibody. The system includes, in an 
amount sufficient for at least one assay, a subject 
antibody as a control reagent, and preferably a 
preselected amount of HIV antigen, each as separately 
packaged immunochemical reagents. 

Instructions for use of the packaged reagent are 
also typically included. 

"Instructions for use" typically include a 
tangible expression describing the reagent 
concentration or at least one assay method parameter 
such as the relative amounts of reagent and sample to 
be admixed, maintenance time periods for reagent/ 
sample admixtures, temperature, buffer conditions and 
the like. 

In embodiments for detecting HIV in a body fluid, 
a diagnostic system of the present invention can 
include a label or indicating means capable of 
signaling the formation of an immunocomplex containing 
a human monoclonal antibody of the present invention. 

The word "complex" as used herein refers to the 
product of a specif ic binding reaction such as an 
antibody-antigen reaction. Exemplary complexes are 
immunoreaction products. 

As used herein, the terms "label", and "indicating 
means" in their various grammatical forms refer to 
single atoms and molecules that are either directly or 
indirectly involved in the production of a detectable 
signal to indicate the presence of a complex. Any 
label or indicating means can be linked to or 
incorporated in an expressed protein, polypeptide, or 
antibody molecule that is part of an antibody or 
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monoclonal antibody composition of the present 
invention, or used separately, and those atoms or 
molecules can be used alone or in conjunction with 
additional reagents. Such labels are themselves well- 
5 known in clinical diagnostic chemistry and constitute 
a part of this invention only insofar as they are 
utilized with otherwise novel proteins methods and/or 
systems . 

The labeling means can be a fluorescent labeling 
10 agent that chemically binds to antibodies or antigens 
without denaturing them to form a fluorochrome (dye) 
that is a useful immuno fluorescent tracer. Suitable 
fluorescent labeling agents are f luorochromes such as 
fluorescein isocyanate (FIC) , fluorescein 
15 isothiocyanate (FITC) , 5-dimethylamine-l- 
naphthalenesulf onyl chloride (DANSC) , 
tetramethylrhodamine isothiocyanate (TRITC) , 
lissamine, rhodamine 8200 sulphonyl chloride (KB 200 
SC) and the like. A description of immunofluorescence 
20 analysis techniques is found in DeLuca, 

"Immunofluorescence Analysis", in A ntibody As a Too l. 
Marchalonis et al., eds., John Wiley & Sons, Ltd., pp. 
189-231 (1982) , which is incorporated herein by 
reference. 

25 In preferred embodiments, the indicating group is 

an enzyme, such as horseradish peroxidase (HRP) , 
glucose oxidase, or the like. In such cases where the 
principal indicating group is an enzyme such as HRP or 
glucose oxidase, additional reagents are required to 

30 visualize the fact that a receptor-ligand complex 

(immunoreactant) has formed. Such additional reagents 
for HRP include hydrogen peroxide and an oxidation dye 
precursor such as diaminobenzidine. An additional 
reagent useful with glucose oxidase is 2 , 2 » -amino-di- 

35 (3-ethyl-benzthiazoline-G-sulfonic acid) (ABTS) . 
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Radioactive elements are also useful labeling 
agents and are used illustratively herein* An 
exemplary radiolabeling agent is a radioactive element 
that produces gamma ray emissions. Elements which 
5 themselves emit gamma rays, such as 124 I, 125 I, 128 I, 132 I 
and 51 Cr represent one class of gamma ray emission- 
producing radioactive element indicating groups. 
Particularly preferred is 12S I. Another group of 
useful labeling means are those elements such as "c, 
10 18 F, 15 0 and 13 N which themselves emit positrons. The 
positrons so emitted produce gamma rays upon 
encounters with electrons present in the animal's 
body. Also useful is a beta emitter, such 111 indium 
of 3 H. 

15 The linking of labels, i.e., labeling of, 

polypeptides and proteins is well known in the art. 
For instance, antibody molecules produced by a 
hybridoma can be labeled by metabolic incorporation of 
radioisotope-containing amino acids provided as a 

20 component in the culture medium. See, for example, 
Galfre et al., Meth. Bnzvmol. , 73:3-46 (1981) . The 
techniques of protein conjugation or coupling through 
activated functional groups are particularly 
applicable. See, for example, Aurameas et al., Scand. 

25 J. Immunol. . Vol. 8 Suppl. 7:7-23 (1978), Rodwell et 

al., Biotech. . 3:889-894 (1984), and U.S. Pat. No. 
4,493,795. 

The diagnostic systems can also include, 
preferably as a separate package, a specific binding 

30 agent. A "specific binding agent" is a molecular 
entity capable of selectively binding a reagent 
species of the present invention or a complex 
containing such a species, but is not itself a 
polypeptide or antibody molecule composition of the 

35 present invention. Exemplary specific binding agents 
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are second antibody molecules, complement proteins or \ 
fragments thereof, S. aureus protein A, and the like, s 
Preferably the specific binding agent binds the 
reagent species when that species is present as part 
5 of a complex. <V 
In preferred embodiments, the specific binding 
agent is labeled. However, when the diagnostic system 
includes a specific binding agent that is not labeled, 
the agent is typically used as an amplifying means or 

10 reagent. In these embodiments, the labeled specific 
binding agent is capable of specifically binding the 
amplifying means when the amplifying means is bound to 
a reagent species-containing complex. 

The diagnostic kits of the present invention carif'.V 

15 be used in an "ELISA" format to detect the quantity of 
an APC inhibitor of this invention in a vascular fluid 
sample such as blood, serum, or plasma. "ELISA" 
refers to an enzyme-linked immunosorbent assay that 
employs an antibody or antigen bound to a solid phase 

20 and an enzyme-antigen or enzyme-antibody conjugate to 
detect and quantify the amount of an antigen present , 
in a sample. A description of the ELISA technique is 
found in Chapter 22 of the 4th Edition of Basic and 
Clinical Immunology bv D.P. Sites et al., published by 

25 Lange Medical Publications of Los Altos, CA in 1982 

and in U.S. Patents No. 3,654,090; No. 3,850,752; and ^ 
No. 4,016,043, which are all incorporated herein by 
reference. 

Thus, in some embodiments, a human monoclonal 
30 antibody of the present invention can be affixed to a 

solid matrix to form a solid support that comprises a ; 
package in the subject diagnostic systems. 

A reagent is typically affixed to a solid matrix \ 
by adsorption from an aqueous medium although other ;V 
35 modes of affixation applicable to proteins and 
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polypeptides well known to those skilled in the art, 
can be used. 

Useful solid matrices are also well known in the 
art. Such materials are water insoluble and include 
5 the cross-linked dextran available under the trademark 
SEPHADEX from Pharmacia Fine Chemicals (Piscataway, 
NJ) ; agarose; beads of polystyrene beads about 1 
micron to about 5 millimeters in diameter available 
from Abbott Laboratories of North Chicago, IL; 

10 polyvinyl chloride, polystyrene, cross-linked 

polyacrylamide, nitrocellulose- or nylon-based webs 
such as sheets, strips or paddles; or tubes, plates or 
the wells of a microtiter plate such as those made 
from polystyrene or polyvinylchloride. 

15 The reagent species, labeled specific binding 

agent or amplifying reagent of any diagnostic system 
described herein can be provided in solution, as a 
liquid dispersion or as a substantially dry power, 
e.g., in lyophilized form. Where the indicating means 

20 is an enzyme, the enzyme's substrate can also be 

provided in a separate package of a system. A solid 
support such as the before-described microtiter plate 
and one or more buffers can also be included as 
separately packaged elements in this diagnostic assay 

25 system. 

The packaging materials discussed herein in 
relation to diagnostic systems are those customarily 
utilized in diagnostic systems • 

The term "package" refers to a solid matrix or 

30 material such as glass, plastic (e.g., polyethylene, 
polypropylene and polycarbonate) , paper, foil and the 
like capable of holding within fixed limits a 
diagnostic reagent such as a monoclonal antibody of 
the present invention- Thus, for example, a package 

35 can be a bottle, vial, plastic and plastic-foil 
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laminated envelope or the like container used to 
contain a contemplated diagnostic reagent or it can be 
a microtiter plate well to which microgram quantities 
of a contemplated diagnostic reagent have been 
5 operatively affixed, i.e., linked so as to be capable 
of being immunologically bound by an antibody or 
polypeptide to be detected. 

The materials for use in the assay of the 
invention are ideally suited for the preparation of a 

10 kit. Such a kit may comprise a carrier means being 

compartmentalized to receive in close confinement one 

or more container means such as vials, tubes, and the 

like, each of the container means comprising one of 

the separate elements to be used in the method. For v J }\ ] 

15 example, one of the container means may comprise a 
human monoclonal antibody of the invention which is, 
or can be, detectably labelled. The kit may also have 
containers containing any of the other above-recited 

immunochemical reagents used to practice the .fy/r 
20 diagnostic methods. 

F. Methods for Producing an HT V-Neutralizincr 
Human Monoclonal Antibody 

The present invention describes methods for 
producing novel HIV-neutralizing human monoclonal 
antibodies. The methods are based generally on the 
use of combinatorial libraries of antibody molecules 
which can be produced from a variety of sources, and 
include naive libraries, modified libraries, and 
libraries produced directly from human donors 
exhibiting an HIV-specific immune response. 

The combinatorial library production and 
manipulation methods have been extensively described 
in the literature, and will not be reviewed in detail 
herein, except for those feature required to make and 
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use unique embodiments of the present invention. 
However, the methods generally involve the use of a 
filamentous phage (phagemid) surface expression vector 
system for cloning and expressing antibody species of 
5 the library. Various phagemid cloning systems to 

produce combinatorial libraries have been described by 
others. See, for example the preparation of 
combinatorial antibody libraries on phagemids as 
described by Kang et al., Proc. Natl. Acad. Sci.. USA , 

10 88:4363-4366 (1991); Barbas et al., Proc. Natl. Acad. 

Sci.. USA . 88:7978-7982 (1991); Zebedee et al., Proc. 
Natl. Acad. Sci., USA . 89:3175-3179 (1992); Kang et 
al., Proc. Natl. Acad. Sci., USA , 88:11120-11123 
(1991); Barbas et al., Proc. Natl. Acad. Sci.. USA . 

15 89:4457-4461 (1992); and Gram et al., Proc. Natl. 

Acad. Sci. . USA . 89:3576-3580 (1992), which references 
are hereby incorporated by reference. 

In one embodiment, the method involves preparing 
a phagemid library of human monoclonal antibodies by 

20 using donor immune cell messenger RNA from HIV- 
infected donors* The donors can be symptomatic of 
AIDS, but in preferred embodiments the donor is 
asymptomatic, as the resulting library contains a 
substantially higher number of HIV-neutralizing human 

25 monoclonal antibodies. 

In another embodiment, the donor is naive 
relative to an immune response to HIV, i.e., the donor 
is not HIV-infected. Alternatively, the library can 
be synthetic, or can be derived from a donor who has 

30 an immune response to other antigens. 

The method for producing a human monoclonal 
antibody generally involves (1) preparing separate H 
and L chain-encoding gene libraries in cloning vectors 
using human immunoglobulin genes as a source for the 

35 libraries, (2) combining the H and L chain encoding 
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gene libraries into a single dicistronic expression 
vector capable of expressing and assembling a 
heterodimeric antibody molecule, (3) expressing the 
assembled heterodimeric antibody molecule on the 
surface of a filamentous phage particle, (4) isolating 
the surface-expressed phage particle using 
immunoaff inity techniques such as panning of phage 
particles against a preselected antigen, thereby 
isolating one or more species of phagemid containing 
particular R and L chain-encoding genes and antibody 
molecules that immunoreact with the preselected 
antigen . 

As described herein the Examples, the resulting 
phagemid library can be manipulated to increase and/or 
alter the immunospecif icities of the monoclonal 
antibodies of the library to produce and subsequently 
identify additional, desirable, human monoclonal 
antibodies of the present invention. 

For example, the heavy (H) chain and light (L) 
chain immunoglobulin molecule encoding genes can be 
randomly mixed (shuffled) to create new HL pairs in an 
assembled immunoglobulin molecule. Additionally, 
either or both the H and L chain encoding genes can be 
mutagenized in the complementarity determining region 
(CDR) of the variable region of the immunoglobulin 
polypeptide, and subsequently screened for desirable 
immunoreaction and neutralization capabilities. 

In one embodiment, the H and L genes can be 
cloned into separate, monocistronic expression 
vectors, referred to as a "binary" system described 
further herein. In this method, step (2) above 
differs in that the combining of H and L chain 
encoding genes occurs by the co-intrdduction of the 
two binary plasmids into a single host cell for 
expression and assembly of a phagemid having the 



surface accessible antibody heterodimer molecule. 

In one shuffling embodiment, the shuffling can be 
accomplished with the binary expression vectors, each 
capable of expressing a single heavy or light chain 
encoding gene, as described in Example 6. 

In the present methods, the antibody molecules 
are monoclonal because the cloning methods allow for 
the preparation of clonally pure species of antibody 
producing cell lines. In addition, the monoclonal > 
antibodies are human because the H and L chain 
encoding genes are derived from human immunoglobulin 
producing immune cells, such as spleen, thymus, bone 
marrow, and the like. 

The method of producing a HIV-neutralizing human 
monoclonal antibody also requires that the resulting 
antibody library, immunoreactive with a preselected 
HIV antigen, is screened for the presence of antibody 
species which have the capacity to neutralize HIV in 
one or more of the assays described herein for 
determining neutralization capacity. Thus, a 
preferred library of antibody molecules is first 
produced which binds to an HIV antigen, preferably 
gpl60, gpl20, gp41, the V3 loop region of gp!60, or 
the CD4 binding site of gpl20 and gp41, and then is 
screened for the presence of HIV-neutralizing 
antibodies as described herein. 

Additional libraries can be screened from 
shuffled libraries for additional HIV-immunoreactive 
and neutralizing human monoclonal antibodies. 

As a further characterization of the present 
invention the nucleotide and corresponding amino acid 
residue sequence of the antibody molecule's H or L . 
chain encoding gene is determined by nucleic acid 
sequencing. The primary amino acid residue sequence 
information provides essential information regarding 
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the antibody molecule^ epitope reactivity. 

Sequence comparisons of identified HIV* 
immunoreactive monoclonal antibody variable chain 
region sequences are shown herein in Figures 10-13. 
5 The sequences are aligned based on sequence homology, 
and groups of related antibody molecules are 
identified thereby in which heavy chain or light chain 
genes share substantial sequence homology. 

An exemplary preparation of a human monoclonal 

10 antibody is described in the Examples. The isolation 
of a particular vector capable of expressing an 
antibody of interest involves the introduction of the 
dicistronic expression vector into a host cell 
permissive for expression of filamentous phage genes 

15 and the assembly of phage particles. Where the binary 
vector system is used, both vectors are introduced in 
the host cell. Typically, the host is E. coli . 
Thereafter, a helper phage genome is introduced into 
the host cell containing the immunoglobulin expression 

20 vector (s) to provide the genetic complementation 

necessary to allow phage particles to be assembled. 
The resulting host cell is cultured to allow the 
introduced phage genes and immunoglobulin genes to be 
. expressed, and for phage particles to be assembled and 

25 shed from the host cell. The shed phage particles are 
then harvested (collected) from the host cell culture 
media and screened for desirable immunoreaction and 
neutralization properties. Typically, the harvested 
particles are "panned" for immunoreaction with a 

30 preselected antigen. The strongly immunoreactive 

particles are then collected, and individual species 
of particles are clonally isolated and further 
screened for HIV neutralization. Phage which produce 
neutralizing antibodies are selected and used as a 

35 source of a human HIV neutralizing monoclonal antibody 



WO 94/07922 



PCI7US93/09328 



of this invention. 

Human monoclonal antibodies of this invention can 
also be produced by altering the nucleotide sequence 
of a polynucleotide sequence that encodes a heavy or 
5 light chain of a monoclonal antibody of this 

invention. For example, by site directed mutagenesis, 
one can alter the nucleotide sequence of an expression 
vector and thereby introduce changes in the resulting 
expressed amino acid residue sequence. Thus one can 

10 take the polynucleotide of SEQ ID NO 66, for example, 
and convert it into the polynucleotide of SEQ ID NO 
67. Similarly, one can take a known polynucleotide 
and randomly alter it by random mutagenesis, 
reintroduce the altered polynucleotide into an 

15 expression system and subsequently screen the product 
H:L pair for HIV-neutralizing activity. 

Site-directed and random mutagenesis methods are 
well known in the polynucleotide arts, and are not to 
be construed as limiting as methods for altering the 

2t) nucleotide sequence of a subject polynucleotide. 

Due to the presence of the phage particle in an 
immunoaf finity isolated antibody, one embodiment 
involves the manipulation of the resulting cloned 
genes to truncate the immunoglobulin-coding gene such 

25 that a soluble Fab fragment is secreted by the host E. 
coli cell containing the phagemid vector. Thus, the 
resulting manipulated cloned immunoglobulin genes 
produce a soluble Fab which can be readily 
characterized in ELISA assays for epitope binding 

30 studies, in competition assays with known anti-HIV 

antibody molecules, and in HIV neutralization assays. 
The solubilized Fab provides a reproducible and 
comparable antibody preparation for comparative and 
characterization studies. 

35 The preparation of soluble Fab is generally 
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described in the immunological arts, and can be 
conducted as described herein in Example 2b6) , or as 
described by Burton et al., Proc. Natl. Acad. Sci. . 
USA , 88:10134-10137 (1991). 

G. Expression Vectors and Polynucleotides for 
Expressing Ant i -HIV Monoclonal Antibodies 
The preparation of human monoclonal 
antibodies of this invention depends, in one 
embodiment, on the cloning and expression vectors used 
to prepare the combinatorial antibody libraries 
described herein. The cloned immunoglobulin heavy and 
light chain genes can be shuttled between lambda 
vectors, phagemid vectors and plasmid vectors at 
various stages of the methods described herein. 

The phagemid vectors produce fusion proteins that 
are expressed on the surface of an assembled 
filamentous phage particle. 

A preferred phagemid vector of the present 
invention is a recombinant DNA (rDNA) molecule 
containing a nucleotide sequence that codes for and is 
capable of expressing a fusion polypeptide containing, 
in the direction of amino- to carboxy-terminus, (1) a 
prokaryotic secretion signal domain, (2) a 
heterologous polypeptide defining an immunoglobulin 
heavy or light chain variable region, and (3) a 
filamentous phage membrane anchor domain. The vector 
includes DNA expression control sequences for 
expressing the fusion polypeptide, preferably 
prokaryotic control sequences. 

The filamentous phage membrane anchor is 
preferably a domain of the cpIII or cpVIII coat 
protein capable of associating with the matrix of a 
filamentous phage particle, thereby incorporating the 
fusion polypeptide onto the phage surface. 
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The secretion signal is a leader peptide domain 
of a protein that targets the protein to the 
periplasmic membrane of gram negative bacteria. A 
preferred secretion signal is a pelB secretion signal. 
The predicted amino acid residue sequences of the 
secretion signal domain from two pelB gene product 
variants from Erwinia carotova are described in Lei et 
al. f Nature . 331:543-546 (1988). 

The leader sequence of the pelB protein has 
previously been used as a secretion signal for fusion 
proteins (Better et al., Science , 240:1041-1043 
(1988); Sastry et al., Proc. Natl. Acad. Sci.. USA . 
86:5728-5732 (1989); and Mullinax et al., Proc. Natl. 
Acad. Sci. . USA . 87:8095-8099 (1990)). Amino acid 
residue sequences for other secretion signal 
polypeptide domains from E. coli useful in this 
invention as described in Oliver, Escherichia coli 
and Salmonella Tvphimurium / Neidhard. F.C. (ed.), 
American Society for Microbiology, Washington, D.C., 
1:56-69 (1987) . 

Preferred membrane anchors for the vector are 
obtainable from filamentous phage M13, fl, fd, and 
equivalent filamentous phage. Preferred membrane 
anchor domains are found in the coat proteins encoded 
by gene III and gene VIII. The membrane anchor domain 
of a filamentous phage coat protein is a portion of 
the carboxy terminal region of the coat protein and 
includes a region of hydrophobic amino acid residues 
for spanning a lipid bilayer membrane, and a region of 
charged amino acid residues normally found at the 
cytoplasmic face of the membrane and extending away 
from the membrane. 

In the phage fl, gene VIII coat protein's 
membrane spanning region comprises residue Trp-26 
through Lys-40, and the cytoplasmic region comprises 
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the carboxy-terminal 11 residues from 41 to 52 (Ohkawa 
et al., J. Biol. Chem. . 256:9951-9958 ^(1981) ) . An ^ 
exemplary membrane anchor would consist of residues 26 ; 
to 40 of cpVIII. Thus, the amino acid residue 
5 sequence of a preferred membrane anchor domain is 7 . ' 
derived from the M13 filamentous phage gene VIII coat 
protein (also designated cpVIII or CP 8) . Gene VIII 
coat protein is present on a mature filamentous phage 
over the majority of the phage particle with typically 

10 about 2500 to 3000 copies of the coat protein. 

In addition, the amino acid residue sequence of 
another preferred membrane anchor domain is derived 
from the M13 filamentous phage gene III coat protein 
(also designated cpIII) . Gene III coat protein is 

15 present on a mature filamentous phage at one end of 

the phage particle with typically about 4 to 6 copies 
of the coat protein. > 

For detailed descriptions of the structure of 
filamentous phage particles; their coat proteins and 

20 particle assembly, see the reviews by Rached et al., 
Microbiol. Rev. . 50:401-427 (1986) ; and Model et al., 
in "The Bacteriophages: Vol. 2", R. Calendar, ed. 
Plenum Publishing Co., pp. 375-456 (1988). 

DNA expression control sequences comprise a set 

25 of DNA expression signals for expressing a structural 
gene product and include both 5* and 3 V elements, as 
is well known, operatively linked to the cistron such 
that the cistron is able to express a structural gene 
product. The 5 f control sequences define a promoter 

30 for initiating transcription and a ribosome binding 
site operatively linked at the 5* terminus of the 
upstream translatable DNA sequence. 

To achieve high levels of gene expression in E. 
coli . it is necessary to use not only strong promoters V" 

35 to generate large quantities of mRNA, but also 
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ribosome binding sites to ensure that the mRNA is 
efficiently translated. In E. coli . the ribosome 
binding site includes an initiation codon (AUG) and a 
sequence 3-9 nucleotides long located 3-11 nucleotides 
upstream from the initiation codon (Shine et al., 
Nature . 254:34 (1975). The sequence, AGGAGGU, which 
is called the Shine-Da lgarno (SD) sequence, is 
complementary to the 3 1 end of E. coli 16S rRNA. 
Binding of the ribosome to mRNA and the sequence at 
the 3 1 end of the mRNA can be affected by several 
factors: 

(i) The degree of complementarity between 
the SD sequence and 3' end of the 16S rRNA. 

(ii) The spacing and possibly the DNA 
sequence lying between the SD sequence and the AUG. 
Roberts et al., Proc. Natl. Acad. Sci.. USA . 76:760, 
(1979a); Roberts et al., Proc. Natl. Acad. Sci. USA . 
76:5596 (1979b); Guarente et al. . Science . 209:1428 
(1980); and Guarente et al.. Cell . 20:543 (1980). 
Optimization is achieved by measuring the level of 
expression of genes in plasmids in which this spacing 
is systematically altered. Comparison of different 
mRNAs shows that there are statistically preferred 
sequences from positions -20 to +13 (where the A of 
the AUG is position 0) . Gold et al. , Annu. Rev. 
Microbiol. . 35:365 (1981). Leader sequences have been 
shown to influence translation dramatically. Roberts 
et al., 1979 a, b supra . 

(iii) The nucleotide sequence following the 
AUG, which affects ribosome binding. Taniguchi et 
al., J. Mol. Biol. . 118:533 (1978). 

The 3 • control sequences define at least one 
termination (stop) codon in frame with and operatively 
linked to the heterologous fusion polypeptide. 

In preferred embodiments, the vector utilized 
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includes a prokaryotic origin of replication or 
replicon, i.e., a DNA sequence having the ability to 
direct autonomous replication and maintenance of the 
recombinant DNA molecule extra chromosomally in a 
5 prokaryotic host cell, such as a bacterial host cell, 
transformed therewith. . Such origins of replication 
are well known in the art. Preferred origins of 
replication are those that are efficient in the host 
organism. A preferred host cell is E. coli . For use 

10 of a vector in E. coli . a preferred origin of 

replication is ColEl found in pBR322 and a variety of 
other common plasmids. Also preferred is the pl5A 
origin of replication found on pACYC and its 
derivatives. The ColEl and p!5A replicon have been 

15 extensively utilized in molecular biology, are 

available on a variety of plasmids and are described 
at least by Sambrook et al., in "Molecular Cloning: a 
Laboratory Manual", 2nd edition, Cold Spring Harbor 
Laboratory Press (1989) . 

20 The ColEl and pl5A replicons are particularly 

preferred for use in one embodiment of the present 
invention where two "binary" plasmids are utilized 
because they each have the ability to direct the 
replication of plasmid in E. coli while the other 

25 replicon is present in a second plasmid in the same E. 
coli cell. In other words, ColEl and p!5A are non- 
interfering replicons that allow the maintenance of 
two plasmids in the same host (see, for example, 
Sambrook et al., supra, at pages 1.3-1.4). This 

30 feature is particularly important in the binary 

vectors embodiment of the present invention because a 
single host cell permissive for phage replication must 
support the independent and simultaneous replication 
of two separate vectors, namely a first vector for 

35 expressing a heavy chain polypeptide, and a second 
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vector for expressing a light chain polypeptide. 

In addition, those embodiments that include a 
prokaryotic replicon can also include a gene whose 
expression confers a selective advantage, such as drug 
resistance, to a bacterial host transformed therewith. 
Typical bacterial drug resistance genes are those that 
confer resistance to ampicillin, tetracycline, 
neomycin/kanamycin or cholamphenicol. Vectors 
typically also contain convenient restriction sites 
for insertion of translatable DNA sequences. 
Exemplary vectors are the plasmids pUC8, pUC9, pBR322, 
and pBR329 available from BioRad Laboratories, 
(Richmond, CA) and pPL and pKK223 available from 
Pharmacia, (Piscataway, NJ) • 

A vector for expression of a monoclonal antibody 
of the invention on the surface of a filamentous phage 
particle is a recombinant DNA (rDNA) molecule adapted 
for receiving and expressing translatable first and 
second DNA sequences in the form of first and second 
polypeptides wherein one of the polypeptides is fused 
to a filamentous pjiage coat protein membrane anchor. 
That is, at least one of the polypeptides is a fusion 
polypeptide containing a filamentous phage membrane 
anchor domain, a prokaryotic secretion signal domain, 
and an immunoglobulin heavy or light chain variable 
domain. 

A DNA expression vector for expressing a 
heterodimeric antibody molecule provides a system for 
independently cloning (inserting) the two translatable 
DNA sequences into two separate cassettes present in 
the vector, to form two separate cistrons for 
expressing the first and second polypeptides of the 
antibody molecule, or the ligand binding portions of 
the polypeptides that comprise the antibody molecule 
(i.e., the H and L variable regions of an 
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immunoglobulin molecule) . The DNA expression vector 
for expressing two cistrons is referred to as a 
dicistronic expression vector. 

The vector comprises a first cassette that 
includes upstream and downstream translatable DNA 
sequences operatively linked via a sequence of 
nucleotides adapted for directional ligation to an 
insert DNA. The upstream translatable sequence 
encodes the secretion signal as defined herein. The 
downstream translatable sequence encodes the 
filamentous phage membrane anchor as defined herein. 
The cassette preferably includes DNA expression 
control sequences for expressing the receptor 
polypeptide that is produced when an insert 
translatable DNA sequence (insert DNA) is 
directionally inserted into the cassette via the 
sequence of nucleotides adapted for directional 
ligation. The filamentous phage membrane anchor is 
preferably a domain of the cpIII or cpVIII coat 
protein capable of binding the matrix of a filamentous 
phage particle, thereby incorporating the fusion 
polypeptide onto the phage surface. 

The receptor expressing vector also contains a 
second cassette for expressing a second receptor 
polypeptide. The second cassette includes a second 
translatable DNA sequence that encodes a secretion 
signal, as defined herein, operatively linked at its 
3 1 terminus via a sequence of nucleotides adapted for 
directional ligation to a downstream DNA sequence of 
the vector that typically defines at least one stop 
codon in the reading frame of the cassette. The 
second translatable DNA sequence is operatively linked 
at its 5 f terminus to DNA expression control sequences 
forming the 5 f elements. The second cassette is 
capable, upon insertion of a translatable DNA sequence 
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(insert DNA) , of expressing the second fusion 
polypeptide comprising a receptor of the secretion -V 
signal with a polypeptide coded by the insert DNA. 
An upstream translatable DNA sequence encodes a 
5 prokaryotic secretion signal as described earlier* ^( 
The upstream translatable DNA sequence encoding the "/v 
pelB secretion signal is a preferred DNA sequence for 
inclusion in a receptor expression vector. A 
downstream translatable DNA sequence encodes a 

10 filamentous phage membrane anchor as described 

earlier. Thus, a downstream translatable DNA sequence 
encodes an amino acid residue sequence, that 
corresponds, and preferably is identical, to the 
membrane anchor domain of either a filamentous phage 

15 gene III or gene VIII coat polypeptide. '<}. 
A cassette in a DNA expression vector of this 
invention is the region of the vector that forms, upon 
insertion of a translatable DNA sequence (insert DNA), 
a sequence of nucleotides capable of expressing, in an 

20 appropriate host, a fusion polypeptide. The 

expression-competent sequence of nucleotides is 
referred to as a cistron. Thus, the cassette 
comprises DNA expression control elements operative ly 
linked to the upstream and downstream translatable DNA 

25 sequences. A cistron is formed when a translatable 

DNA sequence is directionally inserted ( direct ionally > ; 
ligated) between the upstream and downstream sequences 
via the sequence of nucleotides adapted for that 
purpose. The resulting three translatable DNA 

30 sequences, namely the upstream, the inserted and the -o 
downstream sequences, are all operatively linked in 
the same reading frame. 

Thus, a DNA expression vector for expressing an </ 
antibody molecule provides a system for cloning 

35 translatable DNA sequences into the cassette portions 
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of the vector to produce cistrons capable of 
expressing the first and second polypeptides, i.e., 
the heavy and light chains of a monoclonal antibody. 

As used herein, the term " vector" refers to a 
nucleic acid molecule capable of transporting between 
different genetic environments another nucleic acid to 
which it has been operatively linked. Preferred 
vectors sure those capable of autonomous replication 
and expression of structural gene products present in 
the DNA segments to which they are operatively linked. 
Vectors, therefore, preferably contain the replicons 
and selectable markers described earlier. 

As used herein with regard to DNA sequences or 
segments, the phrase "operatively linked" means the 
sequences or segments have been covalently joined, 
preferably by conventional phosphodiester bonds, into 
one strand of DNA, whether in single or double 
stranded form. The choice of vector to which 
transcription unit or a cassette of this invention is 
operatively linked depends directly, as is well known 
in the art, on the functional properties desired, 
e.g. , vector replication and protein expression, and 
the host cell to be transformed, these being 
limitations inherent in the art of constructing 
recombinant DNA molecules. 

A sequence of nucleotides adapted for directional 
ligation, i.e., a poly linker, is a region of the DNA 
expression vector that (1) operatively links for 
replication and transport the upstream and downstream 
translatable DNA sequences and (2) provides a site or 
means for directional ligation of a DNA sequence into 
the vector. Typically, a directional poly linker is a 
sequence of nucleotides that defines two or more 
restriction endonuclease recognition sequences, or 
restriction sites. Upon restriction cleavage, the two 
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sites yield cohesive termini to which a translatable 
DNA sequence can be ligated to the DNA expression 
vector. Preferably, the two restriction sites 
provide, upon restriction cleavage, cohesive termini 
5 that are non-complementary and thereby permit 

directional insertion of a translatable DNA sequence 
into the cassette. In one embodiment, the directional 
ligation means is provided by nucleotides present in 
the upstream translatable DNA sequence, downstream 

10 translatable DNA sequence, or both. In another 

embodiment, the sequence of nucleotides adapted for 
directional ligation comprises a sequence of 
nucleotides that defines multiple directional cloning 
means. Where the sequence of nucleotides adapted for 

15 directional ligation defines numerous restriction 

sites, it is referred to as a multiple cloning site. 

In a preferred embodiment, a DNA expression 
vector is designed for convenient manipulation in the 
form of a filamentous phage particle encapsulating a 

20 genome according to the teachings of the present 
invention. In this embodiment, a DNA expression 
vector further contains a nucleotide sequence that 
defines a filamentous phage origin of replication such 
that the vector, upon presentation of the appropriate 

25 genetic complementation, can replicate as a 

filamentous phage in single stranded replicative form 
and be packaged into filamentous phage particles. 
This feature provides the ability of the DNA 
expression vector to be packaged into phage particles 

30 for subsequent segregation of the particle, and vector 
contained therein, away from other particles that 
comprise a population of phage particles. 

A filamentous phage origin of replication is a 
region of the phage genome, as is well known, that 

35 defines sites for initiation of replication, 
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termination of replication and packaging of the 
replicative form produced by replication (see, for 
example, Rasched et al., Microbiol. Rev. . 50:401-427 
(1986); and Horiuchi, J. Mol. Biol. . 188:215-223 
5 (1986)). 

A preferred filamentous phage origin of 
replication for use in the present invention is an 
M13, fl or fd phage origin of replication (Short et 
al., Nucl. Acids Res. . 16:7583-7600 (1988)). 

10 Preferred DNA expression vectors for cloning and 
expression a human monoclonal antibody of this 
invention are the dicistronic expression vectors 
pComb8, pComb2-8, pComb3, pComb2-3 and pComb2-3', 
described herein. 

15 A particularly preferred vector of the present 

invention includes a polynucleotide sequence that 
encodes a heavy or light chain variable region of a 
human monoclonal antibody of the present invention. 
Particularly preferred are vectors that include a 

20 nucleotide sequence that encodes a heavy or light 

chain amino acid residue sequence shown in Figures 10- 
13, that encodes a heavy or light chain having the 
binding specificity of those sequences shown in 
Figures 10-13, or that encodes a heavy or light chain 

25 having conservative substitutions relative to a 

sequence shown in Figures 10-13, and complementary 
polynucleotide sequences thereto. 

Insofar as polynucleotides are component parts of 
a DNA expression vector for producing a human 

30 monoclonal antibody heavy or light chain 

immunoglobulin variable region amino acid residue 
sequence, the invention also contemplates isolated 
polynucleotides that encode such heavy or light chain 
sequences • 

35 It is to be understood that, due to the genetic 
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code and its attendant redundancies, numerous 
polynucleotide sequences can be designed that encode a 
contemplated heavy or light chain immunoglobulin 
variable region amino acid residue sequence. Thus, 
5 the invention contemplates such alternate 

polynucleotide sequences incorporating the features of 
the redundancy of the genetic code. 

Insofar as the expression vector for producing a 
human monoclonal antibody of this invention is carried 

10 in a host cell compatible with expression of the 
antibody, the invention contemplates a host cell 
containing a vector or polynucleotide of this 
invention* A preferred host cell is E. coli r as 
described herein. 

15 E. coli cultures containing preferred expression 

vectors that produce a human monoclonal antibody of 
this invention were deposited pursuant to Budapest 
Treaty requirements with the American Type Culture 
Collection (ATCC) , Rockville, MD, as described herein. 

20 

Examples 

The following examples are intended to 
illustrate, but not limit, the scope of the invention. 

25 1. Construction of a Dicistronic Expression Vector 

for Producing a Heterodimeric Receptor on Phage 
Particles 

To obtain a vector system for generating a large 
number of Fab antibody fragments that can be screened 

30 directly, expression libraries in bacteriophage Lambda 
have previously been constructed as described in Huse 
et al., Science . 246:1275-1281 (1989). These systems 
did not contain design features that provide for the 
expressed Fab to be targeted to the surface of a 

35 filamentous phage particle. 



WO 94/07922 



PCT/US93/09328 



59 

The main criterion used in choosing a vector 
system was the necessity of generating the largest 
number of Fab fragments which could be screened 
directly. Bacteriophage Lambda was selected as the 
5 starting point to develop an expression vector for 

three reasons. First, jji vitro packaging of phage DNA 
was the most efficient method of reintroducing DNA 
into host cells. Second, it was possible to detect 
protein expression at the level of single phage 

10 plagues. Finally, the screening of phage libraries 
typically involved less difficulty with nonspecific 
binding. The alternative, plasmid cloning vectors, 
Eire only advantageous in the analysis of clones after 
they have been identified. This advantage was not 

15 lost in the present system because of the use of a 
dicistronic expression vector such as pCombVIII, 
thereby permitting a plasmid containing the heavy 
chain, light chain, or Fab expressing inserts to be 
excised. 

20 a. Construction of Dicistronic Expression 

Vector pCOMB 

1) Preparation of Lambda Zap TH II 

Lambda Zap™ II is a derivative of the 
original Lambda Zap (ATCC Accession No. 40,298) that 

25 maintains all of the characteristics of the original 
Lambda Zap including 6 unique cloning sites, fusion 
protein expression, and the ability to rapidly excise 
the insert in the form of a phagemid (Bluescript SK-) , 
but lacks the SAM 100 mutation, allowing growth on 

30 many Non-Sup F strains, including XLl-Blue. The 

Lambda Zap™ II was constructed as described in Short 
et al., Nuc. Acids Res. , 16:7583-7600, 1988, by 
replacing the Lambda S gene contained in a 4254 base 
pair (bp) DNA fragment produced by digesting Lambda 

35 Zap with the restriction enzyme Nco I. This 4254 bp 
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DNA fragment was replaced with the 4254 bp DNA 
fragment containing the Lambda S gene isolated from 
Lambda gtlO (ATCC # 40,179) after digesting the vector 
with the restriction enzyme Nco I. The 4254 bp DNA 
5 ■ fragment isolated from lambda gtlO was ligated into 

the original Lambda Zap vector using T4 DNA ligase and 
standard protocols such as those described in Current 
Protocols in Molecular Biology . Ausubel et al., eds., 
John Wiley and Sons, NY, 1987, to form Lambda Zap™ 
10 II. 

2) Preparation of Lambda Hc2 

To express a plurality of V H -coding DNA 
homologs in an E. coli host cell, a vector designated 
Lambda Hc2 was constructed. The vector provided the 

15 following: the capacity to place the V H -coding DNA 
homologs in the proper reading frame; a ribosome 
binding site as described by Shine et al., Nature . 
254:34 (1975); a leader sequence directing the 
expressed protein to the periplasmic space designated 

20 the pelB secretion signal; a polynucleotide sequence 

that coded for a known epitope (epitope tag); and also 
a polynucleotide that coded for a spacer protein 
between the V H -coding DNA homo log and the 
polynucleotide coding for the epitope tag. Lambda Hc2 

25 has been previously described by Huse et al., Science, 
246:1275-1281 (1989). 

To prepare Lambda Hc2, a synthetic DNA sequence 
containing all of the above features was constructed 
by designing single stranded polynucleotide segments 

30 of 20-40 bases that would hybridize to each other and 
form the double stranded synthetic DNA sequence shown 
in Figure 1. The individual single-stranded 
polynucleotide segments are shown in Table 1. 

Polynucleotides N2, N3, N9-4, Nil, N10-5, N6, N7 

35 and N8 (Table 1) were kinased by adding 1 Ml of each 
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polynucleotide 0.1 micrograms/microliter (Mg/Ml) and 
20 units of T A polynucleotide kinase to a solution 
containing 70 nM Tris-HCl (Tris[hydroxymethyl] 
aminomethane hydrochloride) at pH 7.6, 10 mM MgCl 2 , 5 
5 mM dithiothreitol (DTT) , 10 mM beta-mercaptoethanol, 
500 micrograms per milliliter (pg/ml) bovine serum 
albumin (BSA) . The solution was maintained at 37 
degrees Centigrade (37°C) for 30 minutes and the 
reaction stopped by maintaining the solution at 65 °C 

10 for 10 minutes. The two end polynucleotides, 20 
nanograms (ng) of polynucleotides Nl and 
polynucleotides N12, were added to the above kinasing 
reaction solution together with 1/10 volume of a 
solution containing 20 mM Tris-HCl at pH 7.4, 2.0 mM 

15 MgCl 2 and 50 mM NaCl. This solution was heated to 
70 °C for 5 minutes and allowed to cool to room 
temperature, approximately 25°C, over 1.5 hours in a 
500 ml beaker of water. During this time period all 
10 polynucleotides annealed to form the double 

20: stranded synthetic DNA insert shown in Figure 1. The 
individual polynucleotides were covalently linked to 
each other to stabilize the synthetic DNA insert by 
adding 40 pi of the above reaction to a solution 
containing 50 MM Tris-HCl at pH 7.5, 7 mM MgCl^ 1 mM 

25 DTT, 1 mM adenosine triphosphate (ATP) and 10 units of 
T4 DNA ligase. This solution was maintained at 37 °C 
for 30 minutes and then the T4 DNA ligase was 
inactivated by maintaining the solution at 65 °C for 10 
minutes. The end polynucleotides were kinased by 

30 mixing 52 Ml of the above reaction, 4 pi of a solution 
containing 10 mM ATP and 5 units of T4 polynucleotide 
kinase. This solution was maintained at 37 °C for 30 
minutes and then the T4 polynucleotide kinase was 
inactivated by maintaining the solution at 65 °C for 10 

35 minutes. 



WO 94/07922 



PCTAJS93/09328 



62 
Table 1 



SEQ 








ID NO 








(15) 


Nl) 


5* 


GGCCGCAAATTCTATTTCAAGGAGAGAGTCAT 3' 


(16) 


N2) 


5' 


AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3 * 


(17) 


N3) 


5« 


GTTATTACTCGCTGCCCAACCAGCCATGGCCC 3 • 


(18) 


N6) 


5' 


CAGTTTCACCTGGGCCATGGCTGGTTGGG 3' 


(19) 


N7) 


5« 


CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3 ' 


(20) 


N8) 


5' 


GTATTTCATTATGACTGTCTCCTTGAAATAGAATTTGC 3' 


(21) 


N9-4) 


5' 


AGGTGAAACTGCTCGAGATTTCTAGACTAGTTACCCGTAC 3 • 


(22) 


N10-5) 


5' 


CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 3 • 


(23) 


Nil) 


5' 


GACGTTCCGGACTACGGTTCTTAATAGAATTCG 3' 


(24) 


N12) 


5» 


TCGACGAATTCTATTAAGAACCGTAGTC 3' 



15 

The completed synthetic DNA insert was ligated 
directly into the Lambda Zap™ II vector described in 
Example lal) that had been previously digested with 
the restriction enzymes. Not I and Xho I. The 

20 ligation mixture was packaged according to the 

manufacture's instructions using Gigapack II Gold 
packing extract available from Stratagene f La Jolla, 
California. The packaged ligation mixture was plated 
on XLl-Blue cells (Stratagene) . Individual lambda 

25 plagues were cored and the inserts excised according 
to the in vivo e xcision protocol for Lambda Zap™ II 
provided by the manufacturer (Stratagene) . This in 
vivo excision protocol moved the cloned insert from 
the Lambda Hc2 vector into a phagemid vector to allow 

30 easy for manipulation and sequencing. The accuracy of 
the above cloning steps was confirmed by sequencing 
the insert using the Sanger dideoxy method described 
in by Sanger et al., Proc. Natl. Acad. Sci. . USA r 
74:5463-5467 (1977) and using the manufacture's 

35 instructions in the AMV Reverse Transcriptase 35 S-ATP 
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sequencing kit (Stratagene) . The sequence of the 
resulting double-stranded synthetic DNA insert in the 
V H expression vector (Lambda Hc2) is shown in Figure 
1. The sequence of each strand (top and bottom) of 
Lambda Hc2 is listed in the Sequence Listing as SEQ ID 
NO 1 and SEQ ID NO 2, respectively. The resultant 
Lambda Hc2 expression vector is shown in Figure 2. 

3) Preparation of Lambda Lc2 

To express a plurality of V L -coding DNA 
homologs in an E. coli host cell, a vector designated 
Lambda Lc2 was constructed having the capacity to 
place the V L -coding DNA homologs in the proper reading 
frame, provided a ribosome binding site as described 
by Shine et al., Nature . 254:34 (1975), provided the 
pelB gene leader sequence secretion signal that has 
been previously used to successfully secrete Fab 
fragments in E. coli by Lei et al., J. Bac. . 169:4379 
(1987) and Better et al., Science . 240:1041 (1988), 
and also provided a polynucleotide containing a 
restriction endonuclease site for cloning. Lambda Lc2 
has been previously described by Huse et al., Science, 
246:1275-1281 (1989). 

A synthetic DNA sequence containing all of the 
above features was constructed by designing single 
stranded polynucleotide segments of 20-60 bases that 
would hybridize to each other and form the double 
stranded synthetic DNA sequence shown in Figure 3. 
The sequence of each individual single-stranded 
polynucleotide segment (01-08) within the double 
stranded synthetic DNA sequence is shown in Table 2. 

Polynucleotides 02, 03, 04, 05, 06 and 07 (Table 
2) were kinased by adding 1 pi (0.1 M9/M1) of each 
polynucleotide and 20 units of T 4 polynucleotide 
kinase to a solution containing 70 mM Tris-HCl at pH 
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7.6, 10 mM MgCl 2 , 5 mM DTT, 10 mM beta- 
mercaptoethanol, 500 fig /ml of BSA. The solution was 
maintained at 37 °C for 30 minutes and the reaction 
stopped by maintaining the solution at 65 °C for 10 
5 minutes. The 20 ng each of the two end polynucleo- 
tides, 01 and 08, were added to the above kinasing 
reaction solution together with 1/10 volume of a 
solution containing 20.0 mM Tris-HCl at pH 7.4, 2.0 mM 
MgCl 2 and 15.0 mM sodium chloride (NaCl) . This 

10 solution was heated to 70 °C for 5 minutes and allowed 
to cool to room temperature, approximately 25 °C, over 
1.5 hours in a 500 ml beaker of water.. During this 
time period all 8 polynucleotides annealed to form the 
double stranded synthetic DNA insert shown in Figure 

15 3. The individual polynucleotides were covalently 
linked to each other to stabilize the synthetic DNA 
insert by adding 40 fil of the above reaction to a 
solution containing 50 mM Tris-HCl at pH 7.5, 7 mM 
MgClj, 1 mM DTT, 1 mM ATP and 10 units of T4 DNA 

20 ligase. This solution was maintained at 37 °C for 30 
minutes and then the T4 DNA ligase was inactivated by 
maintaining the solution at 65 °C for 10 minutes. The 
end polynucleotides were kinased by mixing 52 /il of 
the above reaction, 4 fil of a solution containing 10 

25 mM ATP and 5 units of T4 polynucleotide kinase. This 
solution was maintained at 37 °C for 30 minutes and 
then the T4 polynucleotide kinase was inactivated by 
maintaining the solution at 65 °C for 10 minutes. 

30 Table 2 

SEQ 
XD NO 

(25) 01) 5 1 TGAATTCTAAACTAGTCGCCAAGGAGACAGTCAT 3 1 

(26) 02) 5* AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3' 
35 (27) 03) 5» GTTATTACTCGCTGCCCAACCAGCCATGGCC 3 1 
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(28) 04) 5 • GAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 3' 

(29) 05) 5 1 GTATTTCATTATGACTGTCTCCTTGGCGACTAGTTTAGAA- 

TTCAAGCT 3 1 

(30) 06) 5 1 CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3 1 
5 (31) 07) 5 9 TGACGAGCTCGGCCATGGCTGGTTGGG 3 • 

(32) 08) 5» TCGACGGCCGCTTAACTCTAGAAC 3 1 



The completed synthetic DNA insert was ligated 

10 directly into the Lambda Zap™ II vector described in 
Example lal) that had been previously digested with 
the restriction enzymes Sac I and Xho I. The ligation 
mixture was packaged according to the manufacture's 
instructions using Gigapack II Gold packing extract 

15 (Stratagene) . The packaged ligation mixture was 

plated on XLl-Blue cells (Stratagene). Individual 
lambda plagues were cored and the inserts excised 
according to the in vivo excision protocol for Lambda 
Zap™ II provided by the manufacturer (Stratagene). 

20 This in vivo excision protocol moved the cloned insert 
from the Lambda Lc2 vector into a plasmid phagemid 
vector allow for easy manipulation and sequencing. 
The accuracy of the above cloning steps was confirmed 
by sequencing the insert using the manufactured 

25 instructions in the AMV Reverse Transcriptase ^S-dATP 
sequencing kit (Stratagene). The sequence of the 
resulting Lc2 expression vector (Lambda Lc2) is shown 
in Figure 3. Each strand is separately listed in the 
Sequence Listing as SEQ ID NO 3 and SEQ ID NO 4. The 

30 resultant Lc2 vector is schematically diagrammed in 
' - Figure 4. 

A preferred vector for use in this invention, 
- designated Lambda Lc3, is a derivative of Lambda Lc2 

prepared above. Lambda Lc2 contains a Spe I 

35 restriction site located 3' to the EcoR I restriction 
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site and 5' to the Shine-Dalgarno ribosome binding 
site as shown in the sequence in Figure 3 and in SEQ 
ID NO 3. A Spe I restriction site is also present in 
Lambda Hc2 as shown in Figures 1 and 2 and in SEQ ID 
5 NO 1* A combinatorial vector, designated pComb, was 
constructed by combining portions of Lambda Hc2 and 
Lc2 together as described in Example la4) below. The 
resultant combinatorial pComb vector contained two Spe 
I restriction sites, one provided by Lambda Hc2 and 

10 one provided by Lambda Lc2, with an EcoR I site in 
between. Despite the presence of two Spe I 
restriction sites, DNA homologs having Spe I and EcoR 
I cohesive termini were successfully directionally 
ligated into a pComb expression vector previously 

15 digested with Spe I and EcoR I as described in Example 
lb below. The proximity of the EcoR I restriction 
site to the 3' Spe I site, provided by the Lc2 vector, 
inhibited the complete digestion of the 3 V Spe I site. 
Thus, digesting pComb with Spe I and EcoR I did not 

20 result in removal of the EcoR I site between the two 
Spe I sites. 

The presence of a second Spe I restriction site 
may be undesirable for ligations into a pComb vector 
digested only with Spe I as the region between the two 

25 sites would be eliminated. Therefore, a derivative of 
Lambda Lc2 lacking the second or 3 • Spe I site, 
designated Lambda Lc3, was produced by first digesting 
Lambda Lc2 with Spe I to form a linearized vector. 
The ends were filled in to form blunt ends which are 

30 ligated together to result in Lambda Lc3 lacking a Spe 
I site. Lambda Lc3 is a preferred vector for use in 
constructing a combinatorial vector as described 
below. 

35 4) Preparation of pComb 
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Phagemids were excised from the 
expression vectors Lambda Hc2 or Lambda Lc2 using an 
in vivo excision protocol described above. Double 
stranded DNA was prepared from the phagemid-containing 
5 cells according to the methods described by Holmes et 
al., Anal . Biochem . . 114:193 (1981). The phagemids 
resulting from in vivo excision contained the same 
nucleotide sequences for antibody fragment cloning and 
expression as did the parent vectors, and are 

10 designated phagemid Hc2 and Lc2, corresponding to 
Lambda Hc2 and Lc2, respectively* 

For the construction of combinatorial phagemid 
vector pComb, produced by combining portions of 
phagemid Hc2 and phagemid Lc2, phagemid Hc2 was first 

15 digested with Sac I to remove the restriction site 
located 5 1 to the LacZ promoter. The linearized 
phagemid was then blunt ended with T4 polymerase and 
ligated to result in a Hc2 phagemid lacking a Sac I 
site. The modified Hc2 phagemid and the Lc2 phagemid 

20 were then separately restriction digested with Sea I 
and EcoR I to result in a Hc2 fragment having from 5' 
to 3 1 Sea I, Not I, Xho I, Spe I and EcoR I 
restriction sites and a Lc2 fragment having from 5 1 to 
3 V EcoR I, Sac I, Xba I and Sac I restriction sites. 

25 The linearized phagemids were then ligated together at 
their respective cohesive ends to form pComb, a 
circularized phagemid having a linear arrangement of 
restriction sites of Not I, Xho I, Spe I, EcoR I, Sac 
I, Xba I r Not I, Apa I and Sea I. The ligated 

30 phagemid vector was then inserted into an appropriate 
bacterial host and transformants were selected on the 
antibiotic ampicillin. 

Selected ampicillin resistant transformants were 
screened for the presence of two Not I sites. The 

35 resulting ampicillin resistant combinatorial phagemid 
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vector was designated pComb, the schematic 
organization of which is shown in Figure 5. The 
resultant combinatorial vector, pComb, consisted of a 
DNA molecule having two cassettes to express two 
5 fusion proteins and having nucleotide residue 
sequences for the following operatively linked 
elements listed in a 5* to 3 1 direction: a first 
cassette consisting of an inducible LacZ promoter 
upstream from the LacZ gene; a Not I restriction site; 

10 a ribosome binding site; a pelB leader; a spacer; a 
cloning region bordered by a 5 • Xho and 3 1 Spe I 
restriction site; a decapeptide tag followed by 
expression control stop sequences; an EcoR I 
restriction site located 5 V to a second cassette 

15 consisting of an expression control ribosome binding 

site; a pelB leader; a spacer region; a cloning region 
bordered by a 5* Sac I and a 3' Xba I restriction site 
followed by expression control stop sequences and a 
second Not I restriction site. 

20 A preferred combinatorial vector for use in this 

invention, designated pComb2, is constructed by 
combining portions of phagemid Hc2 and phagemid Lc3 as 
described above for preparing pComb, The resultant 
combinatorial vector, pComb2, consists of a DNA 

25 molecule having two cassettes identical to pComb to 
express two fusion proteins identically to pComb 
except that a second Spe I restriction site in the 
second cassette is eliminated. 

30 b. Construction of the pComblll Vector for 

Expressing Fusion Proteins Having a 
Bacteriophage Coat Protein Membrane Anchor 
Because of the multiple endonuc lease 
restriction cloning sites, the pComb phagemid 

35 expression vector prepared above is a useful cloning 
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vehicle for modification for the preparation of an 
expression vector for use in this invention. To that 
end, pComb was digested with EcoR I and Spe I followed 
by phosphatase treatment to produce linearized pComb. 

5 

1) Preparation of pComblll 

A separate phagemid expression vector 
was constructed using sequences encoding bacteriophage 
cpIII membrane anchor domain. A PGR product defining 

10 the DNA sequence encoding the filamentous phage coat 
protein, cpIII, membrane anchor containing a LacZ 
promoter region sequence 3 9 to the membrane anchor for 
expression of the light chain and Spe I and EcoR I 
cohesive termini was prepared from M13mpl8, a 

15 commercially available bacteriophage vector 
(Pharmacia, Piscatavay, New Jersey) • 

To prepare a modified cpIII, replicative form DNA 
from M13mpl8 was first isolated. Briefly, into 2 ml 
of LB (Luria-Bertani medium) , 50 fil of a culture of a 

20 bacterial strain carrying an F f episome (JM107, JM109 
or TGI) was admixed with a one tenth suspension of 
bacteriophage particles derived from a single plaque. 
The admixture was incubated for 4 to 5 hours at 37 °C 
with constant agitation. The admixture was then 

25 centrifuged at 12,000 x g for 5 minutes to pellet the 
infected bacteria. After the supernatant was removed, 
the pellet was resuspended by vigorous vortexing in 
100 Ml of ice-cold solution I. Solution I was 
prepared by admixing 50 mM glucose, 10 mM EDTA 

30 (disodium ethylenediaminetetraacetic acid) and 25 mM 
Tris-HCl at pH 8.0, and autoclaving for 15 minutes. 

To the bacterial suspension, 200 /il of freshly 
prepared Solution II was admixed and the tube was 
rapidly inverted five times. Solution II was prepared 

35 by admixing 0.2 N NaOH and 1% SDS. To the bacterial 
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suspension, 150 /il o f ice-cold Solution III was 
admixed and the tube was vortexed gently in an 
inverted position for 10 seconds to disperse Solution 
III through the viscous bacterial lysate. Solution 
5 III was prepared by admixing 60 ml of 5 M potassium ■ 
acetate, 11.5 ml of glacial acetic acid and 28.5 ml of 
water. The resultant bacterial lysate was then stored 
on ice for 5 minutes followed by centrifugation at 
12,000 x g for 5 minutes at 4°C in a microfuge. The 

10 resultant supernatant was recovered and transferred to 
a new tube. To the supernatant was added an equal 
volume of phenol/ chloroform and the admixture was 
vortexed. The admixture was then centrifuged at 
12,000 x g for 2 minutes in a microfuge* • The 

15 resultant supernatant was transferred to a new tube 
and the double-stranded bacteriophage DNA was 
precipitated with 2 volumes of ethanol at room 
temperature. After allowing the admixture to stand at 
room temperature for 2 minutes, the admixture was 

20 centrifuged to pellet the DNA. The supernatant was 
removed and the pelleted replicative form DNA was 
resuspended in 25 fil of Tris-HCl at pH 7.6, and 10 mM 
EDTA (TE) . 

The double-stranded N13mpl8 replicative form DNA 
25 was then used as a template for isolating the gene \ 

encoding the membrane anchor domain at cpIII, the ; 
sequence of which is listed in the Sequence Listing as 
SEQ ID NO 33. The amino acid residue sequence of 
membrane anchor domain cpIII is listed in SEQ ID NO 
30 34. M13mpl8 replicative form DNA was prepared as 

described above and used as a template for two PCR 
amplifications for construction of a DNA fragment 
consisting of the mature gene for cpIII membrane 
anchor domain located 5 1 to a sequence encoding the 
35 LacZ promoter, operator and cap-binding site for 
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controlling light chain expression . The restriction 
sites, Spe I and EcoR I, were created in the 
amplification reactions and were located at the 5 1 and 
3 1 ends of the fragment respectively. The procedure 
for creating this fragment by combining the products 
of two separate PGR amplifications is described below. 

The primer pair, G-3(F) (SEQ ID NO 35) and G-3(B) 
(SEQ ID NO 36) listed in Table 3, was used in the 
first PCR reaction as performed above to amplify the 
cpIII membrane anchor gene and incorporate Spe I and 
Nhe I restriction sites into the fragment. For the 
PCR reaction, 2 /xl containing 1 ng of M13mpl8 
replicative form DNA was admixed with 10 jxl of 10X PGR 
buffer purchased commercially (Promega Biotech, 
Madison, Wisconsin) in a 0.5 ml microfuge tube. To 
the DNA admixture, 8 /xl of a 2.5 mM solution of dNTPs 
(dATP, dCTP, dGTP, dTTP) was admixed to result in a 
final concentration of 200 micromolar (/ctH) . Three pi 
(equivalent to 60 picomoles (pM) ) of the 6-3 (F) primer 
and 3 /il (60 pM) of the 3 9 backward G-3(B) primer was 
admixed into the DNA solution. To the admixture, 73 
/xl of sterile water and 1 Ml/5 units of polymerase 
(Promega Biotech) was added. Two drops of mineral oil 
were placed on top of the admixture and 40 rounds of 
PCR amplification in a thermocycler were performed. 
The amplification cycle consisted of 52 °C for 2 
minutes, 72 °C for 1.5 minutes and 91°C for 2 minutes. 
The resultant PCR modified cpIII membrane anchor 
domain DNA fragment from M13mpl8 containing samples 
were then purified with Gene Clean (BI0101, La Jolla, 
California) , extracted twice with phenol/chloroform, 
once with chloroform followed by ethanol precipitation 
and were stored at -70°C in 10 mM Tris-HCl at pH 7.5, 
and 1 mM EDTA. 

The resultant PCR modified cpIII DNA fragment 
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having Spe I and Nhe I sites in the 5* and 3* ends, 
respectively, of the fragment was verified by 
electrophoresis in a 1% agarose gel. The area in the 
agarose containing the modified cpIII DNA fragment was 
isolated from the agarose. The sequence of the PCR 
modified cpIII membrane anchor domain DNA fragment is 
listed in the Sequence Listing as SEQ ID NO 40. The 
resultant amplified PCR fragment also contained 
nucleotide sequences for encoding a five amino acid 
tether composed of four glycine residues and one 
serine juxtaposed between the heavy chain and cpIII 
encoding domains. Once expressed, the five amino acid 
residue sequence lacking an orderly secondary 
structure served to minimize the interaction between 
the Fab and cpIII domains. 

A second PCR amplification using the primer 
pairs, Lac-F (SEQ ID NO 37) and Lac-B (SEQ ID NO 38) 
listed in Table 3, was performed on a separate aliquot 
of M13mpl8 replicative form template DNA to amplify 
the LacZ promoter, operator and Capr-binding site 
having a 5 1 Nhe I site and a 3 1 EcoR I site. The 
primers used for this amplification were designed to 
incorporate a Nhe I site on the 5* end of the 
amplified fragment to overlap with a portion of the 3 1 
end of the cpIII gene fragment and of the Nhe I site 
3 1 to the amplified cpIII fragment. The reaction and 
purification of the PCR product was performed as 
described above. The sequence of the resultant PCR 
modified cpIII DNA fragment having a 5 1 Nhe I and 3 1 
EcoR I restriction site is listed in the Sequence 
Listing as SEQ ID NO 41. 

An alternative Lac-B primer for use in 
constructing the cpIII membrane anchor and LacZ 
promotor region was Lac-B 1 as shown in Table 3. The 
amplification reactions were performed as described 
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above with the exception that in the second PGR 
amplification, Lac-B* was used with Lac-F instead of 
Lac-B. The product from the amplification reaction is 
listed in the sequence listing as SEQ ID NO 41 from 
nucleotide position 1 to nucleotide position 172 • The 
use of Lac-B 1 resulted in a LacZ region lacking 29 
nucleotides on the 3* end but was functionally 
equivalent to the longer fragment produced with the 
Lac-F and Lac— B primers. 

The products of the first and second PCR 
amplifications using the primer pairs G-3(F) and 
G-3(B) and Lac-F and Lac-B were then recombined at the 
nucleotides corresponding to cpIII membrane anchor 
overlap and Nhe I restriction site and subjected to a 
second round of PCR using the G-3(F) (SEQ ID NO 35) 
and Lac-B (SEQ ID NO 38) primer pair to form a 
recombined PCR DNA fragment product consisting of the 
following: a 5 1 Spe I restriction site; a cpIII DNA 
membrane anchor domain beginning at the nucleotide 
residue sequence which corresponds to the amino acid 
residue 198 of the entire mature cpIII protein; an 
endogenous stop site provided by the membrane anchor 
at amino acid residue number 112; a Nhe I restriction 
site, a LacZ promoter, operator and Cap-binding site 
sequence; and a 3» EcoR I restriction site* 

To construct a phagemid vector for the coordinate 
expression of a heavy chain-cpIII fusion protein as 
prepared in Example 2 with kappa light chain, the 
recombined PCR modified cpIII membrane anchor domain 
DNA fragment was then restriction digested with Spe I 
and EcoR I to produce a DNA fragment for directional 
ligation into a similarly digested pComb2 phagemid 
expression vector having only one Spe I site prepared 
in Example la4) to form a pComb2-III (also referred to 
as pComb2-III) phagemid expression vector. Thus, the 
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resultant ampicillin resistance conferring pComb2-3 
vector , having only one Spe I restriction site, 
contained separate LacZ promoter/ operator sequences 
for directing the separate expression of the heavy 
5 chain (Fd)-cpIII fusion product and the light chain 

protein. The expressed proteins were directed to the 
periplastic space by pelB leader sequences for 
functional assembly on the membrane. Inclusion of the 
phage Fl intergenic region in the vector allowed for 

10 packaging of single stranded phagemid with the aid of 
helper phage. The use of helper phage superinfection 
lead to expression of two forms of cpIII. Thus, 
normal phage morphogenesis was perturbed by 
competition between the Fab-cpIII fusion and the 

15 native cpIII of the helper phage for incorporation 

into the virion for Fab-cpVIII fusions. In addition, 
also contemplated for use in this invention are 
vectors conferring chloramphenicol resistance and the 
like. 

20 A more preferred phagemid expression vector for 

use in this invention having additional restriction 
enzyme cloning sites, designated pComb-III' or 
pComb2~3 9 , was prepared as described above for 
pComb2-3 with the addition of a 51 base pair fragment 

25 from pBluescript as described by Short et al., Nuc. 
Acids Res . . 16:7583-7600 (1988) and commercially 
available from Stratagene . To prepare pComb2-3', 
pComb2-3 was first digested with Xho I and Spe I 
restriction enzymes to form a linearized pComb2-3. 

30 The vector pBluescript was digested with the same 

enzymes releasing a 51 base pair fragment containing 
the restriction enzyme sites Sal I, Acc I, Hinc II, 
Cla I, Hind III, EcoR V, Pst I, Sma I and BamH I. The 
51 base pair fragment was ligated into the linearized 

35 pComb2-3 vector via the cohesive Xho I and Spe I 



WO 94/07922 PCT/US93/09328 

■\ 75 -S' 

termini to form pComb2-3'. 

Table 3 . -.V 

seq : : ' ! " 

5 ID NO Primer . _ ; ■;; ;; 

(35) 1 G-3 (F) 5 1 GAGAC GACTAGTGGTCGCGGTGGCTCTCCATTC 

GTTTGTGAATATCAA 3» 

(36) 2 G-3 (B) 5 1 TTACT AGCTAGCATAATAACGGAATACCCAAAA 

GAACTGG 3* 

10 (37) 3 LAC-F 5' TATGCTAGCTAGTAA CACGACAGGTTTCCCGAC 

TGG 3 • 

(38) 4 LAC-B 5 f ACCGAGCT CGAATTC GTAATCATGGTC 3 1 

(39) 5 LAC-B' 5* AGCTGT TGAATTCG TGAAATTGTTATCCGCT 3 ■ 

F Forward Primer 
B Backward Primer 

1 From 5 9 to 3': Spe I restriction site sequence is 
single underlined; the overlapping sequence with the 

20 5' end of cpIII is double underlined 

2 From 5 f to 3 1 : Nhe I restriction site sequence is 
single underlined; the overlapping sequence with 3* 
end of cpIII is double underlined • 

3 From 5* to 3*: overlapping sequence with the 3* 
25 end of cpIII is double underlined; Nhe I restriction 

sequence begins with the nucleotide residue n G w at 
position 4 and extends 5 more residues = GCTAGC. 

4 EcoR I restriction site sequence is single 
underlined. 

30 5 Alternative backwards primer for amplifying LacZ; 

EcoR I restriction site sequence is single underlined. 
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2. 



Isolation of HIV-l-Specif ic Monoclonal Antibodies 
Produced from the Dicistronic Expression Vector. \ 1 > 
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pComb2-3 

In practicing this invention, the heavy (Fd 
consisting of V H and C^l) and light (kappa) chains (V L , 
C L ) of antibodies are first targeted to the periplasm 
5 of E. coli for the assembly of heterodimeric Fab 

molecules. In order to obtain expression of antibody 
Fab libraries on a phage surface, the nucleotide 
residue sequences encoding either the Fd or light 
chains must be operatively linked to the nucleotide 

10 residue sequence encoding a filamentous bacteriophage 
coat protein membrane anchor. A coat protein for use 
in this invention in providing a membrane anchor is 
III (cpIII or cp3) . In the Examples described 
herein, methods for operatively linking a nucleotide 

15 residue sequence encoding a Fd chain to a cpIII 

membrane anchor in a fusion protein of this invention 
are described. 

In a phagemid vector, a first and second cistron 
consisting of translatable DNA sequences are 

20 operatively linked to form a dicistronic DNA molecule. 

Each cistron in the dicistronic DNA molecule is linked 
to DNA expression control sequences for the coordinate 
expression of a fusion protein, Fd-cpIII, and a kappa 
light chain. 

25 The first cistron encodes a periplasmic secretion 

signal (pelB leader) operatively linked to the fusion 
protein, Fd-cpIII. The second cistron encodes a 
second pelB leader operatively linked to a kappa light 
chain. The presence of the pelB leader facilitates 

30 the coordinated but separate secretion of both the 
fusion protein and light chain from the bacterial 
cytoplasm into the periplasmic space. 

In this process, the phagemid expression vector 
carries an ampicillin selectable resistance marker 

35 gene (beta lactamase or bla) in addition to the 
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Fd-cpIII fusion and the kappa chain. The fl phage 
origin of replication facilitates the generation of 
single stranded phagemid. The isopropyl 
thiogalactopyranoside (IPTG) induced expression of a 
dicistronic message encoding the Fd-cpIII fusion (V H/ 
C H1 , cpIII) and the light chain (V L , C L ) leads to the 
formation of heavy and light chains. Each chain is 
delivered to the periplastic space by the pelB leader 
sequence, which is subsequently cleaved. The heavy 
chain is anchored in the membrane by the cpIII 
membrane anchor domain while the light chain is 
secreted into the periplasm. The heavy chain in the 
presence of light chain assembles to form Fab 
molecules. This same result can be achieved if, in 
the alternative, the light chain is anchored in the 
membrane via a light chain fusion protein having a 
membrane anchor and heavy chain is secreted via a pelB 
leader into the periplasm. 

With subsequent infection of E. coli with a 
helper phage, as the assembly of the filamentous 
bacteriophage progresses , the coat protein III is 
incorporated on the tail of the bacteriophage. 

a. Preparation of Lymphocyte RNA 

Five milliliters of bone marrow was removed 
by aspiration from HIV-1 asymptomatic seropositive 
individuals. Total cellular RNA was prepared from the 
bone marrow lymphocytes as described above using the 
RNA preparation methods described by Chomczynski et 
al., Anal Biochem. , 162:156-159 (1987) and using the 
RNA isolation kit (Stratagene) according to the 
manufacturer's instructions. Briefly, for immediate 
homogenization of the cells in the isolated bone 
marrow, 10 ml of a denaturing solution containing 3.0 
M guanidinium isothiocyanate containing 71 Ml of 
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beta-mercaptoethanol was admixed to the isolated bone 
marrow. One ml of sodium acetate at a concentration 
of 2 M at pH 4.0 was then admixed with the homogenized 
cells. One ml of phenol that had been previously 
5 saturated with HjO was also admixed to the denaturing 
solution containing the homogenized spleen. Two ml of 
a chloroform: isoamyl alcohol (24:1 v/v) mixture was 
added to this homogenate. The homogenate was mixed 
vigorously for ten seconds and maintained on ice for 

10 15 minutes. The homogenate was then transferred to a 
thick-walled 50 ml polypropylene centrifuged tube 
(Fisher Scientific Company, Pittsburgh, PA) • The 
solution was centrifuged at 10,000 x g for 20 minutes 
; at 4°C. The upper RNA-containing aqueous layer was 

15 transferred to a fresh 50 ml polypropylene centrifuge 
tube and mixed with an equal volume of isopropyl 
alcohol. This solution was maintained at -20°C for at 
least one hour to precipitate the RNA. The solution 
containing the precipitated RNA was centrifuged at 

20 10,000 x g for twenty minutes at 4°C. The pelleted 

total cellular RNA was collected and dissolved in 3 ml 
of the denaturing solution described above. Three ml 
of isopropyl alcohol was added to the re-suspended 
total cellular RNA and vigorously mixed. This 

25 solution was maintained at -20 °C for at least 1 hour 
to precipitate the RNA. The solution containing the 
precipitated RNA was centrifuged at 10,000 x g for ten 
minutes at 4°C. The pelleted RNA was washed once with 
a solution containing 75% ethanol. The pelleted RNA 

30 was dried under vacuum for 15 minutes and then 
re-suspended in dimethyl pyrocarbonate-treated 
(DEPC-H z O) H 2 0. 

Messenger RNA (mRNA) enriched for sequences 
containing long poly A tracts was prepared from the 

35 total cellular RNA using methods described in 
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Molecular Cloning: A Laboratory Manual , Maniatis et 
al., eds., Cold Spring Harbor, NY, (1982). Briefly, 
one half of the total RNA isolated from a single donor 
prepared as described above was resuspended in one ml 
5 of DEPC-HjO and maintained at 65 °C for five minutes. 
One ml of 2X high salt , loading buffer consisting of 
100 mM Tris-HCl, 1 M NaCl, 2.0 mM EDTA at pH 7.5, and 
0.2% SDS was admixed to the resuspended RNA and the 
mixture allowed to cool to room temperature. 

10 The total purified mRNA was then used in PGR 

amplification reactions as described in Example 2c. 
Alternatively, the mRNA was further purified to poly 
A+ RNA by the following procedure. The total MRNA was 
applied to an oligo-dT (Collaborative Research Type 2 

15 or Type 3) column that was previously prepared by 

washing the oligo-dT with a solution containing 0.1 M 
sodium hydroxide and 5 mM EDTA and then equilibrating 
the column with DEPC-H^O. The eluate was collected in 
a sterile polypropylene tube and reapplied to the same 

20 column after heating the eluate for 5 minutes at 65°C. 
The oligo-dT column was then washed with 2 ml of high, 
salt loading buffer consisting of 50 mM Tris-HCl at pH 
7.5, 500 mM sodium chloride, 1 mM EDTA at pH 7.5 and 
0.1% SDS. The oligo dT column was then washed with 2 

25 ml of IX medium salt buffer consisting of 50 mM 

Tris-HCl, pH 7.5, 100 mM, 1 mM EDTA and 0.1% SDS. The 
messenger RNA was eluted from the oligo-dT column with 
1 ml of buffer consisting of 10 MM Tris-HCl at pH 7.5, 
1 mM EDTA at pH 7.5, and 0.05% SDS. The messenger RNA 

30 was purified by extracting this solution with 

phenol/chloroform followed by a single extraction with 
100% chloroform. The messenger RNA was concentrated 
by ethanol precipitation and resuspended in DEPC HjO. 
The resultant purified mRNA contained a plurality 

35 of anti-HIV encoding V H and V L sequences for 
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b. Construction of a Combinatorial HIV-1 
Antibody friary 
5 1) Selection of Oligonucleotide Primers V "•;'//■ 

The nucleotide sequences encoding the ' > 
immunoglobulin protein CDR's are highly variable. 
However, there are several regions of conserved 
sequences that flank the V region domains of either 

10 the light or heavy chain, for instance, and that 

contain substantially conserved nucleotide sequences, 
i.e., sequences that will hybridize to the same primer 
sequence. Therefore, polynucleotide synthesis 
(amplification) primers that hybridize to the v <}%v>\$ 

15 conserved sequences and incorporate restriction sites 
into the DNA homo log produced that are suitable for 
operatively linking the synthesized DNA fragments to a .> 
vector were constructed. More specifically, the 
primers were designed so that the resulting DNA 

20 homologs produced can be inserted into an expression 
vector of this invention in reading frame with the 
upstream translatable DNA sequence at the region of 
the vector containing the directional ligation means. 
For amplification of the V H domains, primers were 

25 designed to introduce cohesive termini compatible with ^ 
directional ligation into the unique Xho I and Spe I 
sites of the pComb2-3 expression vector. In all 
cases, the 5» primers VHla (5 1 

CAGGTGCAGCTCGAGCAGTCT666 3» SEQ ID NO 42) and VH3a (5 9 
30 GAGGTGCAGCTCGAGGAGTCTGGG 3 1 SEQ ID NO 43) were 

designed to maximize homology with the V H l and V H 3 
subgroup families, respectively, although considerable 
cross-priming of other subgroups was expected. The 
Xho I restriction site for cloning into the pComb2-3 y- 
35 vector is underlined. The 3* primer CGI z having the 
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nucleotide sequence 5' GCATGTACTAGTTTTGTCACAAGATTTGGG 
3 9 (SEQ ID NO 44) used in conjunction with the 5* 
primers is the primer for the heavy chain 
corresponding to part of the hinge region* The Spe I 
site for cloning into the pComb2-3 vector is 
underlined. 

The nucleotide sequences encoding the V L domain 
are highly variable. However, there are several 
regions of conserved sequences that flank the V t 
domains including the J L , V L framework regions and V L 
leader /promotor. Therefore, amplification primers 
were constructed that hybridized to the conserved 
sequences and incorporate restriction sites that allow 
cloning the amplified fragments into the pComb2-3 
expression vector cut with Sac I and Xba I. 

For amplification of the kappa V L domains 
analogous to the heavy chain primers listed above, the 
5 9 primers, VKla (5 9 GACAT CGAGCTCA CCCAGTCTCCA 3 1 SEQ 
ID NO 45) and VK3a (5 9 GAAAT TGAGCTCA CGCAGTCTCCA 3 ' SEQ 
ID NO 46) , were used. These primers also introduced a 
Sac I restriction eridonuclease site indicated by the 
underlined nucleotides to allow the V L DNA homo log to 
be cloned into the pComb2-3 expression vector. The 3 1 
V t amplification primer, CRla having a nucleotide 
sequence 5 9 

GCGCC GTCTAGAA CTAACACTCTCCCCTGTTGAAGOTCTCT 
3 9 (SEQ ID NO 47) corresponding to the 3 9 end of the 
light chain was used to amplify the light chain while 
incorporating the underlined Xba I restriction 
endonuclease site required to insert the V L DNA 
homolog into the pComb2-3 expression vector. 

All primers and synthetic polynucleotides 
described herein, were either purchased from Research 
Genetics in Huntsville, Alabama or synthesized on an 
Applied Biosystems DNA synthesizer, model 381A, using 
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the manufacturer's instruction. 

2) PCR Amplification of V n and V L DNA 
Homoloas 

5 In preparation for PCR amplification, 

mRNA prepared above was used as a template for cDNA 
synthesis by a primer extension reaction. First, 
20-50 tig of total mRNA in water was first hybridized 
(annealed) at 70°C for 10 minutes with 600 ng (60.0 

10 pmol) of either the heavy or light chain 3* primers 

listed above. Subsequently, the hybridized admixture 
was used in a typical 50 pi reverse transcription 
reaction containing 200 jiM each of dATP, dCTP, dGTP 
and dTTP, 40 mM Tris-HCl at pH 8.0, 8 mM MgCl 2 , 50 mM 

15 NaCl, 2 mM spermidine and 600 units of reverse 
transcriptase (Superscript, BRL) . The reaction 
admixture was then maintained for one hour at 37 °C to 
form an RNA-cDNA admixture. 

Three /il of the resultant RNA-cDNA admixture was 

20* - then used in PCR amplification in a reaction volume of 
100 fil containing a mixture of all four dNTPs at a 
concentration of 60 /iM, 50 mM KCl, 10 mM Tris-HCl at 
pH 8.3, 15 mM MgCl 2 , 0.1% gelatin and 5 units of 
Thermus aouaticus (Taq) DNA polymerase (Perkin-Elmer- 

25 Cetus, Emeryville, California), and 60 pmol of the 
appropriate 5» and 3 1 primers listed above. The 
separate reaction admixtures were overlaid with 
mineral oil and subjected to 35 cycles of 
amplification. Each amplification cycle included 

30 denaturation at 91°C for 1 minute, annealing at 52 °C 
for 2 minutes and polynucleotide synthesis by primer 
extension (elongation) at 72°C for 1.5 minutes, 
followed by a final maintenance period of 10 minutes 
at 72°C. An aliquot of the reaction admixtures were 

35 then separately electrophoresed on a 2% agarose gel. 
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After successful amplification as determined by gel 
electrophoretic migration, the remainder of the 
RNA-cDNA was amplified after which the PGR products of 
a common 3' primer were pooled into separate V H -and 
5 V L -coding DNA homo log-containing samples and were then 
extracted twice with phenol/ chloroform, once with 
chloroform , ethanol precipitated and were stored at 
-70°C in 10 mM Tris-HCl at pH 7.5, and 1 mM EDTA. . 

10 3) Insertion of V n and V L -Codincr DNA 

Homoloos into pComb2-3 Expression 
Vector 

The V H -coding DNA homologs (heavy 
chain) prepared above were then digested with an 

15 excess of Xho I and Spe I for subsequent ligation into 
a similarly digested and linearized pComb2-3 in a 
total volume of 150 Ml with 10 units of ligase at 16 °C 
overnight* The construction of the library was 
performed as described by Burton et al. f Proc. Natl* 

20 Acad. Sci.. USA. 88:10134-10137 (1991). Briefly, 
following ligation, the pComb2-3 vector containing 
heavy chain DNA was then transformed by 
electroporation into 300 pi of XLl-Blue cells. After 
transformation and culturing, library size was 

25 determined by plating aliguots of the culture. 

Typically the library had about 10 7 members. An 
overnight culture was then prepared from which 
phagemid DNA containing the heavy chain library was 
prepared. 

30 For the cloning of the V L -coding DNA homologs 

(light chain) r 10 jug of phagemid DNA containing the 
heavy chain library was then digested with Sac I and 
Sbal. The resulting linearized vector was treated 
with phosphatase and purified by agarose gel 

35 electrophoresis. The desired fragment, 4.7 kb in 
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length, was excised from the gel. Ligation of this 
vector with prepared light chain PGR DNA proceeded as 
described above for heavy chain. A library of 
approximately 10 7 members having heavy chain fragments 
5 operatively linked to the cpIII anchor sequence 
(Fd-cpIII) and light chain fragments was thus 
produced. 

4) Preparation of Phage Expressing Fab 

10 He t erod imer s 

Following transformation of the 
resultant library produced above into XLl-Blue cells, 
phage were prepared to allow for isolation of HIV-1 
specific Fabs by panning on target antigens. To 

15 isolate phage on which heterodimer expression has been 
induced, 3 ml of SOC medium (SOC was prepared by 
admixture of 20 g bacto-tryptone, 5 g yeast extract 
and 0.5 g NaCl in one liter of water, adjusting the pH 
to 7.5 and admixing 20 ml of glucose just before use 

20 to induce the expression of the Fd-cpIII and light 
chain heterodimer) was admixed and the culture was 
shaken at 220 rpm for one hour at 37 °C, after which 
time 10 ml of SB (SB was prepared by admixing 30 g 
tryptone, 20 g yeast extract, and 10 g Mops buffer per 

25 liter with pH adjusted to 7) containing 20 pg/ml 
carbenicillin and 10 fig/ml tetracycline and the 
admixture was shaken at 300 rpm for an additional 
hour. This resultant admixture was admixed to 100 ml 
SB containing 50 pg/ml carbenicillin and 10 fig/ml 

30 tetracycline and shaken for one hour, after which time 
helper phage VCSM13 (10 12 pfu) were admixed and the 
admixture was shaken for an additional two hours. 
After this time, 70 kanamycin was admixed and 

maintained at 30°C overnight. The lower temperature 

35 resulted in better heterodimer incorporation on the 
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surface of the phage. The supernatant was cleared by 
centrifugation (4000 rpxn for 15 minutes in a JA10 
rotor at 4°C) . Phage were precipitated by admixture 
of 4% (w/v) polyethylene glycol 8000 and 3% (w/v) NaCl 
5 and maintained on ice for 30 minutes, followed by 
centrifugation (9000 rpm for 20 minutes in a JA10 
rotor at 4°C) . Phage pellets were resuspended in 2 ml 
of PBS and microcentrif uged for three minutes to 
pellet debris, transferred to fresh tubes and stored 

10 at -20 °C for subsequent screening as described below* 
For determining the titering colony forming units 
(cfu) , phage (packaged phagemid) were diluted in SB 
and 1 pi was used to infect 50 fil of fresh (OD600 = 1) 
XLl-Blue cells grown in SB containing ICT/ig/ml : ; :y \-iV 

15 tetracycline. Phage and cells were maintained at room 
temperature for 15 minutes and then directly plated on 
LB/carbenicillin plates. ; 

5) Selection of Anti-HIV-1 Heterodimers on 
20 phage Surfaces 

(a) Multiple Panninos of the Phag e 
Library 

The phage library produced in 
Example 2b4) was panned against recombinant gpl20 of , , 

25 HIV-1 strain Hlb as described herein on coated 
microtiter plate to select for anti-gpl20 
heterodimers. A second phage library was panned 
against recombinant gp41 (American Biotechnologies, 
Boston, MA) as described below to select for anti-gp41 

30 heterodimers. 

The panning procedure used was a modification of 
that originally described by Parmley and Smith 
(Parmley et al., Gene , 73:305-318 (1988). Pour rounds 
of panning were performed to enrich for specific 

35 antigen-binding clones. For this procedure, four 
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wells of a microtiter plate (Cos tar 3690) were coated 
overnight at 4°C with 25 Ml of 40 ng/ml gpl20 or gp41 
(American Biotechnologies) prepared above in 0.1 M 
bicarbonate, pH 8.6. The wells were washed twice with 
5 water and blocked by completely filling the well with 
3% (w/v) BSA in PBS and maintaining the plate at 37 °C 
for one hour. After the blocking solution was shaken 
out, 50 /il of the phage library prepared above 
(typically 10 11 cfu) were admixed to each well, and 

10 the plate was maintained for two hours at 37 °C. 

Phage were removed and the plate was washed once 
with water. Each well was then washed ten times with 
TBS/Tween (50 mM Tris-HCl at pH 7.5, 150 mM NaCl, 0.5% 
Tween 20) over a period of one hour at room 

15 temperature where the washing consisted of pipetting 
up and down to wash the well, each time allowing the 
. well to remain completely filled with TBS/Tween 

between washings. The plate was washed once more with 
distilled water and adherent phage were eluted by the 

20 addition of 50 m! of elution buffer (0.1 M HCl, 

' adjusted to pH 2.2 with solid glycine, containing 1 
mg/ml BSA) to each well followed by maintenance at 
room temperature for 10 minutes. The elution buffer 
was pipetted up and down several times, removed, and 

25 neutralized with 3 fil of 2 M Tris base per 50 Ml of 
elution buffer used. 

Eluted phage were used to infect 2 ml of fresh 
( OD 600 " • E. coli XLl-Blue cells for 15 minutes at 
room temperature, after which time 10 ml of SB 

30 containing 20 Mg/ml carbenicillin and 10 fig/ml 

tetracycline was admixed. Aliguots of 20, 10, and 
1/10 Ml were removed from the culture for plating to 
determine the number of phage (packaged phagemids) 
that were eluted from the plate. The culture was 

35 : shaken for one hour at 37 °C, after which it was added 
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to 100 ml of SB containing 50 fig/ml carbenicillin and 
10 ftg/rnl tetracycline and shaken for one hour. Helper 
phage VCSM13 (10 12 pfu) were then added and the 
culture was shaken for an additional two hours. After 
5 this time, 70 pg/ml kanamycin was added and the 
culture was " incubated at 37 °C overnight. Phage 
preparation and further panning were repeated as 
described above. 

Following each round of panning, the percentage 

10 yield of phage were determined, where % yield - 

(number of phage eluted/ number of phage applied) X 
100. The initial phage input ratio was determined by 
titering on selective plates to be approximately 10 11 
cfu for each round of panning. The final phage output 

15 ratio was determined by infecting two ml of 

logarithmic phase XLl-Blue cells as described above 
and plating aliguots on selective plates. In the 
first panning for gpl 20 -reactive phage, 4.6 X 10 11 
phage were applied to four wells and 7.7 X 10 s phage 

20 were eluted. After the fourth panning 1.0 X 10 s phage 
were eluted. From this procedure, 20 clones were 
selected from the Fab library for their ability to 
bind to glycosylated recombinant gpl20 from the IIIB 
strain of HIV-1. Five clones were selected from the 

25 Fab library specific for binding to gp41. The panned 
phage surface libraries were then converted into ones 
expressing soluble Fab fragments for further screening 
by ELISA as described below. 

In addition to panning on gpl20 of strain IIIB 

30 and gp41, also contemplated as antigens for panning of 
combinatorial libraries is recombinant gpl20 (IIIB 
strain) produced in baculovirus and recombinant gpl20 
(SF2 strain) produced in Chinese Hamster Ovary cells 
obtained as described by Steimer et al., Science, 

35 254:105-108 (1991). Another antigen, a synthetic 
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cyclic peptide, N— CH- ( CHj) 3 C0 [ SISGPGRAFYTG ] NCHgCO-Cys- 
NHj (SEQ ID NO 48) prepared as described by 
Satterthwait et al., Bulletin of the World Health 
Organization , 68: Suppl., 17-25 (1990) corresponding 
to the central most conserved part of the V3 loop of 
gp!20 was coupled to maleimide-activated BSA. The 
library was panned using 1, 2 or 4 ELISA wells coated 
with 1 Mg of protein antigen or 10 pg BSA-peptide per 
well. Four rounds of panning were carried out for 
each antigen as described above. Eluted phage from 
the final round were used to infect XLl-Blue cells. 
Four rounds of panning against the four antigens 
produced an amplification in eluted phage of between 
100 and 1000 fold. The panned phage surface libraries 
were then converted into ones expressing soluble Fab 
fragments for further screening by ELISA as described 
below. 

6) Preparation of Soluble Heterodimers and 
Characterization of Binding Specificity 
to HTV-1 Antigens V 
In order to further characterize the 
specificity of the mutagenized heterodimers expressed 
on the surface of phage as described above, soluble 
Fab heterodimers from acid eluted phage were prepared 
and analyzed in ELISA assays on HIV-1 derived 
antigen-coated plates and by competitive ELISA. 

To prepare soluble heterodimers, phagemid DNA 
from the 20 gp!20 positive clones and the 5 gp41 
positive clones prepared above was isolated and 
digested with Spe I and Nhe I. Digestion with these 
enzymes produced compatible cohesive ends. The 4.7 kb 
DNA fragment lacking the gene III portion was 
gel-purif ied (0.6% agarose) and self-ligated. 
Transformation of E. coli XLl-Blue afforded the 
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isolation of recombinants lacking the cpIII fragment. 
Clones were examined for removal of the cpIII fragment 
by Xho I - Xba I digestion, which should yield an 
1.6-kb fragment. Clones were grown in 100 ml SB 
5 containing 50 MS/ml carbenicillin and 20 mM MgCl 2 at 
37°c until an OD^ of 0.2 was achieved. IPTG (1 niM) 
was added and the culture grown overnight at 30°C 
(growth at 37 °C provides only a light reduction in 
heterodimer yield). Cells were pelleted by 

10 centrifugation at 4000 rpm for 15 minutes in a JA10 
rotor at 4°C. Cells were resuspended in 4 ml PBS 
containing 34 pg/ml phenylmethylsulfonyl fluoride 
(PMSF) and lysed by sonication on ice (2-4 minutes at 
50% duty) . Debris was pelleted by centrifugation at 

15 14,000 rpm in a JA20 rotor at 4°C for 15 minutes. The 
supernatant was used directly for ELISA analysis as 
described below and was stored at -20 °C. For the 
study of a large number of clones, 10 ml cultures 
provided sufficient heterodimer for analysis. In this 

20 case, sonications were performed in 2 ml of buffer. 

Assays as described above were also performed for 
the gp4i-specif ic clones. 

a) ficrtt«ninq by *T-Tfi* 

25 The soluble heterodimer s prepared 

above were assayed by ELISA. For this assay, gpl20 
and gp41 were separately admixed to individual wells 
of a microtiter plate as described above for the 
panning procedure and maintained at 4°C overnight to 

30 allow the protein solution to adhere to the walls of 
the well. After the maintenance period, the wells 
were washed five times with water and thereafter 
maintained for one hour at 37 °C with 100 /il solution 
of 1% BSA diluted in PBS to block nonspecific sites on 

35 the wells. Afterwards, the plates were inverted and 
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shaken to remove the BSA solution. Twenty-five nl of 
soluble heterodimers prepared above reactive with the ; ■ 
specific glycoprotein substrate were then admixed to 
each well and maintained at 37°C for one hour to form \]\\ 
5 immunoreaction products. Following the maintenance 

period, the wells were washed ten times with water to 
remove unbound soluble antibody and then maintained 
with a 25 /il of a 1:1000 dilution of secondary goat 
anti-human IgG F(ab') 2 conjugated to alkaline 

10 phosphatase diluted in PBS containing 1% BSA, The 

wells were maintained at 37°C for one hour after which 
the wells were washed ten times with water followed by 
development with 50 pi of p-nitrophenyl phosphate 
(PNPP) . Color development was monitored at 405 ran. 

15 Positive clones gave A405 values of >1 (mostly >1.5) ;> 
after 10 minutes, whereas negative clones gave values 
of 0.1 to 0.2. 

Approximate concentrations of gpl20-reactive Fab 
were determined by ELXSA using a sandwich ELISA as 

20 described by Zebedee et al.. Proc. Natl. Acad. Sci.. 
USA . 89:3175-3179 (1992) and are presented in the 
first column of Figure 6. In addition, since Fabs are 
expressed in E. coli and the fraction of correctly 
assemble protein can vary, the amount of Fab reacting 

25 with gpl20 was also assessed by ELISA titration. That 
data is also presented in Figure 6 in the second 
column. /;>: '* 

For the clones panned against the HIV-1 derived 
antigens, after conversion of the panned phage surface 

30 libraries to ones expressing soluble Fab fragments, 
30-40 colonies were used to transform XLl-Blue cells 
and the supernates screened in ELISA assays against V 
the antigen used in panning. Generally greater than 
80% of the supernates tested positive. A 

35 representative number of positives were then selected 



from each antigen panning for further analysis. 



(b) Competitiw** ELISA with Soluble 
OP120 and CD4 

Timnunoreactive heterodimers as 
determined in the above ELISA were then analyzed by 
competition ELISA to determine the affinity of the 
selected heterodimers. The ELISA was performed as 
described above on microtiter veils separately coated 
with 5 Mg/nl of gpl20 or soluble CD4 (American 
Biotechnologies) in 0.1 H bicarbonate buffer at pH 
8.6. Increasing concentrations of soluble or free 
gpl20 ranging in concentration from 10" 11 M up to 10" 7 M 
diluted in 0.5% BSA/0. 025% Tween 20 /PBS were admixed 
with soluble heterodimers, the dilutions of which were 
determined in titration experiments that resulted in 
substantial reduction of OD values after a 2 -fold 
dilution. For the CD4 competition assays, increasing 
concentrations of soluble or free CD4 ranging in 
concentration from 10" 11 M up to 10" 6 M diluted in 0.5% 
BSA/0. 025% Tween 20 /PBS were admixed with soluble 
heterodimers. The plates were maintained for 90-120 
minutes at 37 °C and carefully washed ten times with 
0.05% Tween 20 /PBS before admixture of alkaline 
phosphatase-labelled goat anti-human IgG F(ab')2 at a 
dilution of 1:500 followed by maintenance for 1 hour 
at 37°C. Development was performed as described for 
ELISA. 

To establish the relationship between 
neutralizing ability as described in Example 3 below 
could be related to antigen binding affinity of 
HTV-l-specific Fabs, competition ELISAs were carried 
out where soluble gpl20 was competed with gpl20 coated 
on ELISA plates for Fab binding. Figure 7 shows that 
all Fabs were competed from binding to gpl20 with a 
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ICjq of approximately 10" 9 M free gpl20. In addition 
as shown in Example 3, there is no correlation between 
antigen affinity and neutralization. The Fabs tested 
included Fabs 4, 12, 21 and 7 that are members of the 
same groups as determined by sequence analysis and 
comparison as described in Example 4. Fabs 13, 27, 6, 
29, 2 and 3 are all members of the different groups as 
determined by sequence analysis and comparison as 
described in Example 4. Loop 2 is an Fab fragment 
selected from the same library as the other Fabs but 
which recognizes the V3 loop. Only with the V3 loop 
peptide was competition carried out with gpl20 from 
the SF2 strain. 

To investigate whether neutralization could be 
associated with blocking of the gpl20-CD4 interaction, 
competition ELISAs were carried out with soluble CD4 
competing with Fabs for binding to gpl20-coated ELISA 
wells. The results are shown in Figure 8. P4D10 and 
loop 2 are controls not expected to be competed by 
CD4. P4D10 is a mouse monoclonal antibody reacting 
with the V3 loop of gp!20 (IIIB) . Loop 2 Fab 
competition was carried out using gpl20 (SF2) . As 
shown in Figure 8 the binding of all Fabs with the 
exception of the controls was inhibited with an IC^ 
of approximately 10" 8 M of soluble CD4. In addition, 
no difference was detected between the neutralizing 
and non-neutralizing Fabs to gpl20 inhibited by CD4. 
This implies that blocking of the CD4-gpl20 
interaction is unlikely to be an important factor in 
Fab neutralization of the HIV-1 virus. 

Similar competition assays were performed with 
the Fabs panned against the four HIV-1 derived 
antigens. The 19 Fabs derived from panning against 
gp!20 (IIIB) showed apparent affinities 
(1/concentration at 50% inhibition) for gpl20 (IIIB) 
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in the range 10 7 - 1(T 9 M with most being 1-3 X 1(T 8 M. 
The panning procedure tends to select strongly for 
tight binders so a grouping into a relatively narrow 
band of affinities was expected. Of 16 Fabs derived 
5 from panning against gp!60 (IIIB), 6 were also 

reactive with gpl20 (IIIB) and competition ELISAs 
showed they had similar apparent affinities as the 
gpl20-panned Fabs. The non-gpl20 reactive clones from 
the gpl60 panning showed a lower ELISA reactivity with 

10 gp!60 and could not be satisfactorily competed with 

gpl60. They may be directed against gp41 but were not 
pursued here. Eight Fabs derived from panning against 
gpl20 (SF2) also showed strong ELISA reactivity with 
gp!20 (IIIB) and gave similar apparent binding 

15 affinities. Four Fabs were derived from panning 

against the V3 loop peptide. Of these Fabs, 2 reacted 
in ELISA with gpl20 (SF2) but none with gpl20 (IIIB) ♦ 
The apparent binding affinity of these loop binders to 
gpl20 (SF2) was 10* 8 M. 

20 To complete the survey in terms of strain 

cross-reactivity of Fabs, those derived from the gpl20 
and gpl60 (IIIB) pannings were examined for ELISA 
reactivity with gpl20 (SF2) • All were reactive. 
Therefore, all the Fabs examined, with the exception 

25 of those selected by panning against the V3 loop 
peptide, bound to gpl20 from IIIB and SF2 strains. 

The Fabs were screened f or CD4 inhibition of 
their binding to gpl20 (IIIB) immobilized on ELISA 
wells. All, again with the exception of the V3 loop 

30 binders, showed sensitivity to CD4 inhibition. The 
inhibition constants were in the range 10* 7 to 10' 9 M. 

Also contemplated is a competition ELISA assay 
where the binding of HIV-l recombinant Fabs of this 
invention is performed in the presence of excess Fabs 

35 of this invention as well as those HIV-l antibodies, 
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polyclonal or monoclonal, present: in patient sera, 
either asymptomatic or symptomatic, or obtained by 
other means such as EBV transformation and the like. 
The ability of an exogenous ly admixed antibody to 
5 compete for the binding of a characterized Fab of this 
invention will allow for the determination of 
equivalent antibodies in addition to unique epitopes 
and binding specificities. 

10 3. Neutralizing Activity of Rec ran*"T»« nt Human Fab 

Fragments Against HIV-1 In Vitro 
Binding of antibodies to viruses can result in 
loss of infectivity or neutralization and, although 
not the only defense mechanism against viruses, it is 

15 widely accepted that antibodies have an important role 
to play. However, understanding of the molecular 
principles underlying antibody neutralization is 
limited and lags behind that of the other effector 
functions of antibody* Such understanding is required 

20 for the rational design of vaccines and for the most 
effective use of passive antibody for prophylaxis or 
therapy. This is particularly urgent for the human 
immunodeficiency viruses. 

A number of studies have led to the general 

25 conclusion that viruses are neutralized by more than 
one mechanism and the one employed will depend on 
factors such as the nature of the virus, the epitope 
recognized, the isotype of the antibody, the. cell 
receptor used for viral entry and the virus : antibody 

30 ratio. The principle mechanisms of neutralization can 
be considered as aggregation of virions, inhibition of 
attachment of virus to cell receptor and inhibition of 
events following attachment such as fusion of viral 
and cellular membranes and secondary uncoating of the 

35 virion. One of the important features of the third 



mechanism is that it may require far less than the 
approximately stoichiometric amounts of antibody 
expected for the first two mechanisms since occupation 
of a small number of critical sites on the virion may 
be sufficient for neutralization. For instance it has 
been shown that neutralization of the influenza A 
virion obeys single hit kinetics as described by 
Outlaw et al., Epidemiol. Infect. . 106:205-220 (1992). 

Intensive studies have been carried out on 
antibody neutralization of HIV-1. For review, see 
Nara et al., FASEB J. . 5:2437-2455 (1991). Most have 
f ocussed on a single linear epitope in the third 
hypervariable domain of the viral envelope 
glycoprotein gpl20 known as the V3 loop, i; Antibodies v 
to this loop are suggested to neutralize by inhibiting 
fusion of viral and cell membranes. Binding to the 
loop resulting in neutralization can occur prior to : ; 
virus-cell interaction or following gpl20 binding to 
CD4. See, Nara, In Retroviruses of Human Aids and 
Related Animal Diseases, eds. Girard et al., pp. 
3 38-150 (1988); Linsely et al. , J. Virol. . 
62:3695-3702 (1988); and Skinner et al., J. Virol. . 
67:4195-4200 (1988). Features of the V3 loop are 
sequence variability within the loop [Goudsmit et al., 
FASEB J. . 5:2427-2436 (1991) and Albert et al., AIDS, 
4:107-112 (1990)] and sensitivity of neutralizing 
antibodies against the loop to sequence variations 
outside the loop [Nara et al., FASEB J. . 5:2437-2455 
(1991); Albert et al., supra ; McKeating et al., AIDS . 
3:777-784 (1989); and Wahlberg et al., AIDS Res, Hum. 
Retroviruses . 7:983-990 (1991). Hence anti-V3 loop 
antibodies are often strain specific and mutations in 
the loop in vivo may provide a mechanism for viral 
escape from antibody neutralization. 

Recently considerable interest has focused on 
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antibodies capable of blocking CD4 binding to gp!20. 
A number of groups have described the features of 
these antibodies as (a) reacting with conformational 
i.e., non-linear epitopes, (b) reacting with a wide 
range of virus isolates and (c) being the predominant 
neutralizing antibodies in humans after longer periods 
of infection. See, Berkower,et al., J. Virol. r 
65:5983-5990 (1991); Steimer et al.. Science . 
254:105-108 (1991); Ho et al., J. Virol, , 65:489-493 
(1991); Kang et al., Proc. Natl. Acad, Sci. . USA . 
88:6171-6175 (1991); Posner et al., J. Immunol.. 
146:4325-4332 (1991); and Tilley et al., Res. Virol. . 
142:247-259 (1991). Neutralizing antibodies of this 
type would appear to present a promising target for 
potential therapeutics. The mechanism (s) of 
neutralization of these antibodies is unknown although 
there is some indication that this may not be blocking 
of virus attachment since a number of mouse monoclonal 
antibodies inhibiting CD4 binding to gpl20 are either 
non-neutralizing or only weakly neutralizing. 

The generation of human monoclonal antibodies 
against the envelope of HIV-l as described by Burton 
et al. . Proc. Natl. Acad. Sci. . USA . 88:10134-10137 
(1991) using combinatorial libraries allows a novel 
approach to the problem of neutralization. Given the 
lack of a three-dimensional structure for gpl20 and 
the complexity of the virus, the approach seeks to 
explore neutralization at the molecular level through 
the behavior of related antibodies. This is possible 
for the following reasons: (l) the combinatorial 
approach allows the rapid generation of large numbers 
of human antibodies; (2) the antibodies (Fab 
fragments) are expressed in E.coli and can readily be 
sequenced; and (3) antibodies have similar sequences 
and common structural motifs allowing functional 
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differences to be meaningfully correlated with primary 
structure. 

Neutralization studies were performed as 
described herein on the human recombinant Fab 
5 fragments from 20 clones against gpl20 prepared as 

described in Examples l and 2, all of which are strain 
cross-reactive and inhibited by CD4 from binding to 
gpl20. The results presented herein show that 
neutralization was not effected by virus aggregation 

10 or cross-linking of gp!20 molecules on the virion 

surface and was not correlated with blocking of the 
interaction between soluble CD4 and recombinant gpl20. 

Neutralization studies were also performed as 
described herein on the human recombinant Fab 

15 fragments from the gp41-reactive clones prepared as 
described in Examples 1 and 2* The results are 
presented below. 

Two different assays, a p24 ELISA assay and a 
syncytium assay, were performed to measure 

20* neutralization ability of the recombinant human HIV-1 
immunoreactive Fabs. An additional assay/ a plaque 
assay, was performed for determining the 
neutralization effectiveness of the gp41-reactive 
Fabs. In plaque assays, CD 4+ cells were cultured int 

25 he presence or absence of soluble gp4 1-react ive Fabs 
prior to inoculation with virus. Inhibition of 
infectivity, also referred to as neutralization, by 
antibodies was expressed as the percent of plaque 
formation in the cultures compared to cells exposed to 

30 PBS alone. 

For some of these assays, the recombinant Fabs 
were first purified. One liter cultures of SB 
containing 50 fig/ml carbenicillin and 20 mM MgCl 2 were 
inoculated with appropriate clones and induced 7 hours 

35 later with 2*mM IPTG and grown overnight at 30 °C. The 
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cell pellets were sonicated and the resultant 
supernatant were concentrated to a 50 ml volume. The 
filtered supernatants were loaded on a 25 ml protein 
G-anti-Fab column, washed with 120 ml buffer at a rate 
5 of 3 ml/minute and eluted with citric acid at pH 2.3. 
The neutralized fractions were then concentrated and 
exchanged into 50 mM MES at pH 6.0 and loaded onto a 2 
ml Mono-S column at a rate of 1 ml /minute. A gradient 
of 0-500 mM NaCl was run at 1 ml/minute with the Fab 

10 eluting in the range of 200-250 mM NaCl. After 

concentrating, the Fabs were positive when titered on 
ELISA against gp!20 and gave a single band at 50 JcD by 
10-15% SDS-PAGE. Concentration was determined by 
absorbance measurement at 280nm using an extinction 

15 coefficient (1 mg/ml) of 1.4. 

a. Neutralization as Measured by the p24 ELISA 
Assay 

For this assay, diluted tissue culture 

20 supernatants of HIV-1 IIIB or MN-inf ected peripheral 
blood mononuclear cells (PBMC) (5pTCID 5C (50% tissue 
culture infectious dose) , 100 Ml) were maintained for 
2 hours at 37 °C with serial dilutions (1:2), beginning 
at a dilution of 1:20, of recombinant Fab supernates 

25 prepared in Example 2b6) • Control Fab supernates were 
also provided that included human neutralizing sera, a 
known human neutralizing monoclonal antibody 2F5 and 
the Fab fragment derived from that antibody by papain 
digestion, and a known mouse neutralizing monoclonal 

30 antibody and its F(ab f ) 2 fragment as described by 

Broliden et al., J. Virol. . 64:936-940 (1990). PBMC 
(1 x 10 s cells ) were admixed to the virus /antibody 
admixture and maintained for 1 hour at 37 °C 
Thereafter, the cells were washed and maintained in 

35 RPMI 1640 medium (GIBCO) supplemented with 10% fetal 
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calf serum, 1% glutamine, antibiotics and IL-2. The 
culture medium was changed at days 1 and 4. At 7 days 
post-infection, supernates were collected and analyzed 
by HIV-1 p24 antigen capture ELISA as described by 
5 Sundgvist et al., J. Med. Virol. . 29:170-175 (1989) 
the disclosure of which is hereby incorporated by 
reference. Neutralization was defined as positive if 
an 80% or greater reduction of optical density at 
490nm in the culture supernatant occurred as compared 
10 to negative Fab or negative human serum. Tests with 
all Fabs, mAbs and sera were repeated on at least two 
occasions. 

b. Quantitative Infectivity Assay Based on 
15 Syncytial Formation 

A quantitative neutralization assay with the 
MN strain of HIV-1 was performed as described by Nara 
et al., AIDS Res. Human Retroviruses . 3:283-302 
(1987), the disclosure of which is hereby incorporated 

20 by reference. Monolayers of CEM-SS target cells were 
cultured with virus, in the presence or absence of 
antibody, and the number of syncytia forming units 
determined 3-5 days later. An equivalent amount of 
virus was used in the assays to allow direct 

25 comparison of the various antibody concentrations 
tested. The assays were repeatable over a virus- 
surviving fraction range of 1 to 0.001 within a 2 to 
4 -fold difference in the concentration of antibody 
(P<0.001) . 

30 

c. Results o f the Neutralization Assays for 

gpi2Q 

Assays were generally repeated at least 
twice with reproducible results. For the data 
35 reported in Figure 6, the gpi20-specif ic Fab 



supemates were divided into two parts , one being used 
in the p24 assay and the other in the syncytia assay. 
A dash (-) indicates that there was no neutralization 
at 1:20 dilution in the p24 assay and 1:16 in the 
syncytial assay (with most clones showing no 
detectable neutralization at a 1:4 dilution) . 
Neutralization titers are indicated in the figure. 
For the p24 assay, the titer corresponds to the 
greatest dilution producing >80% reduction in 
absorbance in ELISA. For the syncytia assay, Fabs 4 
and 12 produced >95% neutralization at a 1:4 dilution 
of supemate and 80 and 70% reduction at 1:128 
dilution respectively. These Fabs were effective 
neutralizers in both types of assays. They have also 
been shown to neutralize infection by IIIB and RF 
strains using a PCR-based assay of proviral 
integration. Fabs 6 and 7 showed no neutralization in 
the syncytia assay but other supemate preparations 
showed activity. Fab 13 was consistently effective in 
the p24 assay but not in the syncytia assay. A number 
of other clones show lower levels of neutralizing 
ability. 

Fabs were purified from a selection of some of 
the clones as described above and used in both 
neutralization assays. ' As shown in Figure 9, Fabs 4 
and 12 were again effective in both assays at similar 
levels with for example 50% inhibition of syncytial 
formation at an Fab concentration of approximately 20 
nM (1 fig/ml) . The results shown are derived from the 
syncytia assay using the MN strain. Fabs 7 and 21 
were equally effective in the syncytial assay but 
somewhat less so in the p24 assay. The p24 assay /, : J. 
indicated greater than 80% neutralization of HIV-l MN 
strain for Fab 4 at 3, Fab 7 at 15, Fab 12 at 3, Fab./; 
13 at 4 and Fab 21 at 7 iig/ml, respectively. Fab 13 
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however was ineffective in the syncytial assay at 25 
fig /ml. For the IIIB strain, greater than 80% 
neutralization was observed for Fab 4 at 13, Fab 7 at 
15 , Fab 12 at 7 and Fab 21 at 14 jig/ml, respectively. 
5 Although Fab 11 was not effective in neutralization 

assays when unpurified as shown in Figure 6, following 
purification, Fab 11 was equally effective as Fab 12 
in neutralizing HTV-1. For this reason, the Fab is 
being deposited with the ATCC as described in Example 

10 7 along with Fab 12 and Fab 13* 

There are a number of conclusions arising from 
the data shown in the Figures 6 and 9. It is apparent 
that HIV-l can be neutralized without virion 
aggregation or cross-linking of gpl20 molecules on the 

15 virion surface since monovalent Fab fragments are 
effective. To further confirm this finding, a Fab 
fragment was produced by papain digestion of a known 
neutralizing human monoclonal antibody. As shown in 
Figure 6, the Fab fragment was approximately equally 

20 effective as the whole IgG in neutralization of the MN 
strain of HTV-1. This is consistent with results on 
Fabs prepared from two mouse monoclonal antibodies to 
the V3 loop. An F(ab 9 ) 2 fragment of a mouse 
monoclonal antibody was somewhat less effective than 

25 the parent IgG in neutralization of the MN strain. 
Interestingly, the fragments from these control 
antibodies were relatively poor in neutralizing the 
IIIB strain of HIV-l. The results also show that 
there appears to be a difference between the two 

30 assays employed since Fab 13 was consistently 
effective in one assay but not the other. The 
principal variables were the incubation time of the 
virus and antibody prior to infection (2 hours for the 
p24 assay and 0.5 hours for the syncytial assay), the 

35 amount of virus used for infection, the cells used to 
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propagate virus (human PBMCs for the former and H9 
cells for the latter) and the cells infected (human 
PBMCs for the former and CEM.SS cells for the latter) . 
Of these, there is a strong possibility that the MN 
virus used in the two assays, having been passaged 
through different cells, is critically different. 

d. Results o f the Neutralization Assays for 
2Eil 

The gp41-reactive Fabs exhibited specificity 
to the conformation epitope of gp4l including amino 
acid residues in positions 565-585 and 644-663. The 
five selected gp41-specif ic Fabs were designated DL 41 
19, DO 41 11, GL 41 1, MT 41 12 and SS 41 8. 
Neutralization assays were performed as described 
above for the gpl20-reactive Fabs. In the plaque 
assays, the data shown is the concentration of Fab in 
micrograms /milliliter required to achieve 50% of 
neutralization. The data for the other two 
neutralization assays is also expressed in 
micrograms /milliliter of Fab required to neutralize 
infection as defined in the description of the p24 and 
syncytial assays above. The results of the three 
neutralization assays, plaque, syncytial and p24, are 
presented in Table 4. The MN and IIIB HIV strains 
were used as indicated in Table 4 for the assays. The 
abbreviation n ND n stands for not determined when 
indicated in the table. 

Table 4 
Assay /Strain 

Plaoue Syncytial £2£ 

Fab JOT IIIB IIIB MN IIIB 

DL 41 19 <4' <40 1.4 ND ND 
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2.3 0.9 ND 

1.7 ND 3.5 

5.5 4.5 4.5 

2.2 ND 7.1 

5 , .; . 

As shown in Table 4, all five Fabs were effective 
at neutralizing both MN and IIIB strains of HIV in 
either plaque , syncytial or p24 assays. Fabs DL 41 19 
and DO 41 11 exhibited strain specificity in the 

10 plaque assay where the former was ten-fold more 

effective at inhibiting plaque formation with the MN 
strain than with the IIIB strain. The opposite 
specificity was seen with the DO 41 11 Fab. However, 
both Fabs exhibited comparable neutralization as 

15 measured by the syncytial assay. Two Fabs, GL 41 1 
and SS 41 8, were equally effective at inhibiting 
plaque formation with either MN or IIIB strains. The 
Fab MT 41 12 was similarly not strain-specif ic 
although neutralization required 10 fold more 

20 antibody. No strain specificity was evident when Fab 
MT 41 12 was used .in p24 assay? where the .same amount 
of antibody was equally effective. All five 
antibodies were neutralized IIIB as measured in the 
syncytial assay. 

25 Thus, the five gp41-specif ic Fabs neutralized 

HIV-1 MN and IIIB in at least two of the three assays 
used for measuring neutralizing activity. Moreover, 
strain specificity was prevalent in two of the five 
assays as measured by the plaque assay. Based on 

30 these differential neutralization characteristics, the 
gp41~specif ic Fabs provide useful therapeutic reagents 
for neutralizing HIV-1. 

4. Nucleic Acid Sequence Analysis Comparison Between 
35 HIV-l Specific Monoclonal Antibody Fabs and the 



DO 41 11 <40 7.1 

GL 41 1 <4 <4 

MT 41 12 <40 <40 

SS 41 8 <4 <4 
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Corresponding Derived Amino Acid Residue Sequence 
To explore the relationship between neutralizing 
and weakly or non-neutralizing Fabs, the variable 
domains of 32 clones expressing human anti-gpl20 Fabs, 
5 prepared in Example 2 including the 20 listed in 

Figure 6 for which neutralizing activity was assessed, 
were sequenced. In addition, the five gp41-specif ic 
Fabs were also sequenced. 

Nucleic acid sequencing was performed on 
10 double-stranded DNA using Sequenase 1.0 (USB, 

Cleveland, OH) and the appropriate primers hybridizing 
to sequences in the Cgl domain (SEQGb .: 5 1 
GTCGTTGACCAGGCAGCCCAG 3 1 SEQ ID NO 49) or the Ck 
domain (SEQKb : 5* ATAGAAGTTGTTCAGCAGGCA 3« SEQ ID NO 
15 50) . Alternatively sequencing employed single 

stranded DNA and the T3 primer (5 1 ATTAACCCTCACTAAAG 
3 1 , SEQ ID NO 51) or one hybridizing to a sequence in 
the Ck domain (KEF : 5 1 GAATTCTAAACTAGCTAGTTCG 3» SEQ 
ID NO 52) . 

20 The amino acid residue sequences of the variable 

heavy and light chains derived from the nucleic extid 
sequences of the 32 gpl20-specif ic clones are shown 
respectively in Figures 10 and 11. Groupings are made 
on the basis of similarities in heavy chain sequences. 

25 Dots indicate identity with the first sequence in each 
section. The SEQ ID NOs are listed to the right of 
the corresponding derived heavy and light chain (V H 
from SEQ ID NO 53-81 and V L from SEQ ID NO 82-113) 
amino acid residue sequences in the Figures 

3 0 themse Ives • 

Alignment of derived sequences with one another 
and with the Genbank database made use of the 
MacVector suite of programs. For analysis of heavy 
chain CDR3 sequences as described by Sanz, 

35 Immunol . . 147:1720-1729 (1991), the most 5 f nucleotide 
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was considered to be the first nucleotide after codon 
95 of the H chain variable region according to Rabat 
et al, Sequences of Proteins of Immunological 
Interest, US Dept. of Health and Human Services, 
5 Washington, DC (1991). The most 3' nucleotide was 
assigned to the last unidentified nucleotide before 
the sequence matched with the published germ line JH 
genes. The CDR3 sequences were analyzed using the 
DNASTAR software. Sequence comparisons were performed 

10 with both the ALIGN and COMPARE programs in order to 
determine the germline D gene which provided the best 
homology throughout. In a second step, the SEQCOMP 
program was used to find sequence identity of at least 
six nucleotides with either the coding strand or the 

15 reverse complement of germline D genes. 

The heavy and light chain sequences of the gp41~ 
specific Fabs are shown in Figures 18 and 19, 
respectively. The amino acid residue sequence of the 
CDR3 heavy chain exhibits the most variation between 

20 the Fabs than any other region of the variable domain. 

a. Organization of Antibodies into Groups 
According to Heavy Chain Sequence 
V H and V L domains of 32 gpl20 clones were 
25 sequenced and the V H domains compared using MacVector 
software. This analysis immediately established that 
a number of the clones, including those selected by 
panning against different antigens, are closely 
related to one another. The exception to this is the 
30 Fabs selected by panning against the V3 loop peptide 
which are not related to the Fabs selected by panning 
against the gpl20/160 antigens. Figure 10 shows that 
the V H sequences derived from gpl2 0/160 panning can be 
organized into 7 groups. The broad features apparent 
35 from a comparison of amino acid sequences are 
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discussed herein. 

The relatedness of sequences within a group 
varies considerably. For instance, in the group 
beginning with clone number b8 the amino acid 
sequences are very similar. Six clones were identical 
and the remainder showed a maximum of 5 differences 
from the predominant sequence (the EQ difference due 
to the 5* primer excluded). Only one clone showed a 
single difference in the CDR3 region. The average 
discrepancy over all the sequences in this group from 
the predominant sequence is 1.1 amino acid residues/ 
variable domain. This amount corresponds to the order 
of magnitude of discrepancies which could arise from 
the PCR. Sequencing of constant domains indicated a 
PGR error frequency of about 1 base change per domain. 

In contrast, in the group headed by clone b3, no 
two clones were absolutely identical. The average 
difference from the consensus group sequence is 3.3 
residues per sequence and determination for the CDR3 
alone is 1.3. Therefore, it seems likely that the 
heavy chains iji this group are somatic variants of one 
another. 

The group headed by clone 1 presents a third 
pattern. Clones bl and bl4 are identical as are 
clones b2 and B2. However, 23 amino acid differences 
exist between the two sets of clones. Clones b24 and 
B30 are approximately equally well differentiated 
(13-25 differences) from either of these two sets of 
clones or one another. Still the CDR3 regions are 
very similar. A number of explanations can be 
suggested for this pattern: 1) all clones in this 
group originate from the same germ line gene which has 
undergone extensive somatic mutation, 2) cross-over 
events have occurred to essentially recombine 
different genu line genes with the same DJ combination, 
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3) a "convergent evolution" process has led to the 
selection of different germline genes associated with 
the sane DJ combination, 

b. Sequences of the V L Domains from the cml20 
Binders 

The V L sequences of the Fabs were organized 
into the groups defined in Figure 10 are shown in 
Figure 11. Immediately apparent was the extensive 
chain promiscuity as evidenced by the pairing of 
different light chains with the same or a very similar 
heavy chain with retention of antigen .binding 
capability and indeed, for the most part, antigen 
affinity as compared with Figure 10. This promiscuity 
can be explored further by reference to the groups 
considered above. 

The clone b8 group, in which the heavy chain 
members were identical or very similar, also produced 
4 light chains which are identical or very similar 
(less than 3 amino acid differences) . Therefore a 
predominant heavy-light chain combination can be 
described for this group. One member (clone b8) had 
the same or very closely related V L gene but appeared 
to use a different Jk gene. Two other members (clones 
B8 and bl8) were more distantly related to the major 
sequence (7-12 differences) . Two further clones (bl3 
and B26) used a Vk gene from a different family, Vk3 
compared to Vkl, and therefore were unrelated to the 
major sequence. 

The clone b3 group, suggested to contain somatic 
variants of a single heavy chain, showed considerable 
light chain diversity with no two members being 
closely related to one another. Vk3-Jk2 combinations 
predominated but Vk3-Jk3 and Vkl-Jk3 combinations also 
occurred. 
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On the other hand, in the clone bl group evidence 
existed for the heavy chains being more choosy about 
their light chain partner. Thus, closely related 
heavy chains appeared to be paired with related light 
5 chains. The identical heavy chain pairs (bl and bl4; 
b2 and B2) had very similar light chains (2 and 4 
amino acid differences respectively) whereas the 
distinct heavy chains (b24 and B30) had distinct light 
chains which were unrelated to one another or the 

10 other group members. The clone 4 group provides 

another example of this phenomenon in that 4 closely 
related heavy chains were paired with .3 closely 
related light chains (a predominant heavy-light chain 
combination) , except for the clone b7 light chain that 

15 was distinct. 

In summary, the heavy chain (V H ) sequences was 
organized into 7 groups where each member of a group 
has an identical or very similar CDR3 region with a 
limited number of differences elsewhere. When the 

20 light chains (V L ) were constrained into the groupings 
defined by their heavy chain partners, considerable 
light chain sequence variation was observed. This 
phenomenon of chain promiscuity has been observed 
previously and can be appreciated by reference to 

25 Figure 11. Marked neutralizing ability was confined 

to two groups of sequences. The first group consisted 
of Fabs 4, 7, 12 and 21 which have very similar heavy 
and light chains. The second group consisted of Fabs 
13, 8, 18, 22 and 27. Only Fab 13 showed marked 

30 neutralizing ability, although the others showed some 
weaker activity. Interestingly in this group Fab 13 
did have a light chain distinct from the other members 
of the group. 

35 5. Shuffling of the Heavy and Light Chain of a 
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Single Clone Against the Library 
To further explore possible functional 
heavy-light chain combinations, the heavy chain of 
clone bl2 (also referred to as Fab 12 for the 
5 corresponding soluble Fab preparation) shown in Figure 
10 was recombined with the original light chain 
library prepared in Example 2 to construct a new 
library H12-LCn. In addition, the bl2 light chain was 
recombined with the original heavy chain library to 

10 construct a library Hn-L12 . These two libraries were 
taken through 3 rounds of panning against gp!20 (IIIB) 
as described in Example 2b5) . The Fabs expressed from 
the resultant immunoreactant clones were analyzed as 
described in Example 3 above. Clone bl2 was chosen as 

15 this Fab neutralized HIV-1 in vitro as shown in 
Example 3. 

To accomplish the preparation of a shuffled 
library from the Fd gene of clone bl2 with the 
original light chain library, the bl2 heavy chain was 

20 first subcloned into a tetanus toxoid binding clone 
expressed in pComb2-3. The light chain library was 
then cloned into this construction to give a library 
of 1 x 10 7 members. The subcloning step was used to 
avoid contamination with and over-representation of 

25 the original light chain. A similar procedure was 
adopted for shuffling of heavy chains against the 
light chain from clone bl2 to give a library of 3 x 
10* members. Cloning and panning procedures were 
carried out as described above for the original 

30 library. 

Eleven light chains which recombinied with the b!2 
heavy chain and bound gp!20 by panning were randomly 
chosen for subsequent competition ELISA and sequence 
analysis. The apparent affinities of these shuffled 

35 combinations were similar with an IC^ of 
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approximately 1CT 8 to 10" 9 M. The sequences were 
organized where a set of 3 were very similar to the 
original b!2 light chain and the other 8 showing many 
differences from the original with some sub-grouping 
5 possible. 

The sequences of the light chains which bound to 
the bl2 heavy chain clone are shown in Figure 12* The 
sequences are compared to the sequence for the 
original light chain from clone bl2. The light chains 

10 are identified by numbers which do not correspond to 
the original light chain clones; the assigned numbers 
of the newly selected clones having new light chains 
are thus arbitrary. The sequences of these light 
chains are also listed in the Sequence Listing from 

15 SEQ ID NO 114 to 122. Some light chain sequences are 
identical. In addition to immunoreactivity with 
gp!20, the new Fabs isolated from these shuffled 
clones were tested in the syncytia assay for 
neutralization of HIV-l infection as described in 

20 Example 3. Four shuffled monoclonal Fab antibodies, 
each having the heavy chain from clone bl2, a known 
HIV-l neutralizing clone, and new light chains 
designated L28, L25, L26 and L22, all exhibited 
approximately 60% neutralization in a syncytia assay 

25 with 0.4 pg/ml purified Fab. This effect was 

equivalent to that obtained with the original clone 
b!2 heavy and light chain pair. Maximum 
neutralization of approximately 80% was obtained with 
the H12/L28 and H12/L25 Fabs at 0.7 fig/ml which was 

30 equivalent to that seen with the original clone bl2 
heavy and light pair. The neutralization resulting 
from the H12/L22 and H12/L26 Fabs plateaued at 60% 
with Fab concentrations of 0.4 Mg/ml up to 1.0 /xg/ml. 
Thus, in addition to the gp!20 immunoreactive and HIV 

35 neutralizing Fabs obtained in the original library 
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prepared as described in Example 2, by shuffling a 
known neutralizing heavy chain with a library of light ; : 
chains, new HIV-1 neutralizing Fab monoclonal 
antibodies have been obtained. 
5 Ten heavy chains which recombined with the b!2 

light chain were also randomly chosen. One was very 
similar to the original b!2 heavy chain but the others 
have many differences. Nevertheless, the V-D and D-J 
junctions were essentially identical indicating the 

10 clones had probably arisen from the same rearranged 
B-cell clone by somatic modification. Competition 
ELISA failed to reveal any clear difference in 
affinity between the variants selected from those 
originally analyzed. Hv vf ; 

15 The sequences of the heavy chains which bound to 

the b!2 light chain clone are shown in Figure 13. The 
sequences are compared to the sequence for the 
original heavy chain from clone b!2. The heavy chains 
are identified by numbers which do not correspond to 

20 the original light chain clones; the assigned numbers 
of the newly selected clones having new heavy chains 
are thus arbitrary. The sequences of these light 
chains are also listed in the Sequence Listing from 
SEQ ID NO 123 to 132. Some light chain sequences are ' 

25 identical. In addition to immunoreactivity with 
gp!20, the new clones were tested in the syncytia 
assay for neutralization of HIV-1 infection as 
described in Example 3. Two shuffled monoclonal Fab 
antibodies, each having the light chain from clone 

30 bl2, a known HIV-1 neutralizing clone, and new heavy 
chains designated H2 and H14, exhibited approximately 
40% neutralization in a syncytia assay with 1.0 and 
0.5 Mg/ml purified Fab, respectively. This effect was 
equivalent to that obtained with the original clone 

35 b!2 heavy and light chain pair at a concentration of 2 
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pg/ml. Maximum neutralization of approximately 50% 
was obtained with the Fab having the new H14 chain at 
1.0 fig/ml compared to 80% neutralization with 0.7 
Mg/ml with the original clone b!2 heavy and light 
5 pair. Thus, in addition to the gp!20 immunoreactive 
and HIV neutralizing Fabs obtained in the original 
library prepared as described in Example 2, by 
shuffling a known neutralizing light chain with a 
library of heavy chains, new HIV-1 neutralizing Fab 

10 monoclonal antibodies have been obtained. 

Thus, this shuffling process revealed many more 
heavy and light chain partners that bound to gpl20 
that were equal in affinity to those obtained from the 
original library prepared in Example 2. With this 

15 approach, additional HIV-1 neutralizing antibodies can 
easily be obtained over those present in an original 
library. The complexity of the clones arising from 
the heavy chain shuffling also suggests that this 
approach may be used to map the course of somatic 

2 0 diversification . 

Combinatorial libraries randomly recombine heavy 
and light chains so to what extent antibodies derived 
from such libraries represent those produced in a 
response in vivo can be determined. In principle, a 

25 heavy* light chain combination binding antigen could 

arise fortuitously, i.e., neither chain is involved in 
binding antigen in vivo but the combination does bind 
antigen in vitro . 

The available data suggests, however, that heavy 

30 chains, from immune libraries, involved in binding 

antigen tightly in vitro arise from antigen-specific 
clones in vivo . First, studies have generally failed 
to identify high-affinity binders in non- immunized IgG 
libraries. See, Persson et al. Proc. Natl. Acad. 

35 Sci, . USA , 88:2432-2436 (1991) and Marks et al. Eur. 
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J. Immunol. . 21:985-991 (1991). 

Further, as described above, gpl20 binders were 
not observed in panning a bone marrow IgG library from 
an HIV seronegative donor against gpl20. Second, 
5 heavy chains associated with binders from immunized . , 
libraries were typically at relatively high frequency 
in the library indicating they were strongly 
represented in the mRNA isolated from immunized 
animals. See, Caton et al., Proc. Natl. Acad. Sci.. 

10 USA . 87:6450-6454 (1990) and Persson et al., supra . 

Third, heavy chains from immunized libraries appeared 
to dictate specificity when recombined with various 
unrelated light chains as described in Example 6. 
Fourth, the isolation of intraclonal heavy chain 

15 variants as here indicated that an active antibody 

response was cloned. Thus, the shuffling of a known 
heavy chain with a light chain binder and vice versa 
is preferred for use in this invention as new 
neutralizing Fabs can be obtained beyond those 

20 generated in vivo . 

Heavy chain promiscuity, the ability of a 

heavy chain to pair with different light chains with 
retention of antigen affinity, presents serious 
problems for identifying in vivo light chain partners. 

25 This applies not only to the strict definition of 
partners as having arisen from the same B-cell but 
also to one which would encompass somatic variants of 
either partner. The existence of predominant 
heavy-light chain combinations, particularly involving 

30 intraclonal light chain variants, suggests that the 
light chains concerned are well represented in the 
library and probably are associated with antigen 
binding in vivo . However, promiscuity means that, 
although some combinations probably do occur in vivo . 

35 one cannot be certain that one is not shuffling immune 
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partner chains in the recombination. For instance, 
the occurrence of a virtually identical light chain 
(b6, B20) in 2 out of 33 clones suggests that it is 
probably over-represented in the library consistent 
5 with an in vivo involvement in antigen-stimulated 

clones • However, there is no way of knowing whether 
the in vivo partner of the light chain is the b6 or 
B20 heavy chain or indeed another heavy chain arising 
from a stimulated clone « 

10 The light chains arising from the combinatorial 

library may not be those employed in vivo * 
Nevertheless it is interesting to note that some heavy 
chains appear relatively choosy about light chain 
partner whereas others appear almost indifferent. 

15 This observation needs to be tempered by the finding 

that apparently choosy heavy chains from this analysis 
will accept diverse light chains with maintenance of 
antigen binding in a binary plasmid system where 
pairings are forced as shown below in Example 6 rather 

20 than selected in a competitive situation. 

Two reports compare heavy-light chain 
combinations arising from combinatorial libraries and 
hybridomas in immunized mice. The library approach 
begins with mRNA and is therefore probably reflecting 

25 plasma cell populations. In contrast, hybridomas sore 
thought to reflect activated but not terminally 
differentiated B cell populations and EBV 
transformation to reflect resting B cell populations. 
Whatever the arguments about light chain 

30 authenticity, the heavy chains of Figure 10 present 
many features of interest. The most frequently used 
heavy chain is of the clone b8 type. It could be 
argued that this usage simply represents bias in PCR 
amplification. However, the occurrence of 

35 approximately equal numbers of clones in this group 
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amplified by VHla and VH3a primers argues against this 
notion. Furthermore, the existence of intraclonal 
variants in some groups indicates that one is at least 
sampling different genes from the initial library. 
5 The antibodies cloned here do bear qualitative 

relationship with the polyclonal antibodies present in 
the serum of the asymptomatic donor. The titer of 
anti-gpl20 (IIIB) antibodies was approximately 1:3000, 
with greater than 50% of the reactivity being 
10 inhibited by CD4 or a cocktail of Fabs from clones 12, 

13 and 14. The titer of anti-gpl20 (SF2) antibodies 
was approximately 1:800. Further, the titer of serum 
against the short constrained V3 loop peptide was 
1:500 and against the full length MN V3 loop peptide 

15 was only 1:300. The importance of W anti-CD4 site 

antibodies" seems general in donors with longer term 
HIV infection in that the cocktail of Fabs 12, 13 and 

14 was able to inhibit binding of a large fraction of 
serum antibody reactivity with gp!20 (IIIB) in 26 of 

20 28 donors tested. 

The ability of Fabs to neutralize viruses has 
been a controversial area. One of the problems has 
been that Fabs are classically generated by papain 
digestion of IgG. If the Fab, as is often the case, 

25 shows reduced activity relative to the parent IgG then 
it may be difficult to rule out IgG contamination in 
the Fab preparation. Recombinant Fabs, however, as 
shown herein definitively neutralize virus. 

The mechanism of neutralization of HIV-1 appears 

30 to neither require virion aggregation nor gpl20 

cross- linking. In addition, there is no correlation 
with blocking of the CD4-gpl20 interaction to 
neutralization. The existence of the cloned 
neutralizing Fabs of this invention should allow the 

35 molecular features that confer neutralizing potential 
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to be explored. For instance , in the case of the 
group of clones containing Fab 13, the unique 
character of the light chain of that neutralizing 
clone suggests that chain shuffling experiments in 
which the 13 light chain was recombined with the other 
heavy chains in that group, might be revealing .* Heavy 
chains paired with two dissimilar light chains have 
been shown to retain antigen affinity but exhibit 
altered fine specificity as shown in Example 6. 

The observation here of a large number of Fabs 
with only a limited number being strongly neutralizing 
may have important consequences* If the pattern is 
repeated for whole antibodies then it would seem that 
much of the gpl20 structure may be in a sense a 
"decoy", i.e., the immune system may invest 
considerable effort in producing antibodies of high 
affinity but limited anti-viral function. To 
exacerbate the situation the ineffective antibodies 
may bind to gp!20 and inhibit the binding of strongly 
neutralizing antibodies. This has obvious 
consequences for vaccination which should be primarily 
designed to elicit neutralizing antibodies of this 
invention. 

6. Shuffling of Selected Heavy and Light Chain DNA 
Sequences of a Combinatorial Library in a Binary 
Plasmid System 

A binary system of repl icon-compatible plasmids 
has been developed to test the potential for 
promiscuous recombination of heavy and light chains 
within sets of human Fab fragments isolated from 
combinatorial antibody libraries. The efficiency of 
the system is demonstrated for the combinatorial 
library of this invention derived from the bone marrow 
library of an asymptomatic HIV donor. 
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a. Construction of the Binary Plasmid System 
The binary plasmids pTACOlH and pTCOl for 
use in this invention contain the pelB leader region 
and multiple cloning sites from Lambda Hc2 and Lambda 
5 Lc3, respectively, and the set of repl icon-compatible 
expression vectors pFL281 and pFL261. Both pFL281 and 
pFL261 have been described by Larimer et al., Prot, 
Ena, . 3:227-231 (1990), the disclosure of which is 
hereby incorporated by reference. The nucleotide 

10 sequences of pFL261 and pFL281 are in the EMBL, 

GenBank and DDBJ Nucleotide Sequence Databases tinder 
the accession numbers M29363 and M68946. The plasmid 
pFL281 is based on the plasmid pFL260 also described 
by Larimer et al., supra, and having the accession 

15 number M29362. The only distinction between the 

plasmids pFL260 and pF128l is that pFL281 lacks a 60 
bp sequence of pFL260 between the Bag I site and the 
Xma III site resulting in the loss of one of the two 
BamH I sites. This deletion is necessary to allow for 

20 cloning of the BamH I Hc2 fragment into the expression 
vector as described herein* 

The replicon-compatible expression vectors share 
three common elements: (i) the fl single-stranded DNA 
page intergenic IG regions; (ii) the tightly regulated 

25 tac promoter and lac operator; and (iii) an rbs-ATG 
region with specific cloning sites. The plasmid 
vectors differ in their antibiotic resistance markers 
and plasmid replicons: pFL261 carries a gene encoding 
chloramphenicol acetyltransf erase (cat) , conferring 

30 chloramphenicol resistances nd the pl5A replicon; 

pFL281 carries a gene encoding beta-lactamase (bla) , 
conferring ampicillin resistance, and the ColEl 
replicon (ori) from pMBl. The p!5A and ColEl 
replicons permit the coincident maintenance of both 

35 plasmids in the same E. coli host. 
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The Hc2 and Lc2 vectors prepared in Examples la2) 
and la3), respectively, were converted into the 
plasmid form using standard methods familiar to one of 
ordinary skill in the art and as described by Sambrook 
5 et al., Molecular Cloning: A Laboratory Manual. 2nd 

ed. r Cold Spring Harbor Laboratory Press, New York 
(1989) and subsequently digested with Xho I-Spe I 
(pHc2) and Sac I-Xba I for (pLc2) . The synthetic 
linkers for insertion into the digested pHc2 and Lc2 

10 plasmids were prepeired by American Synthesis* The 

linkers were inserted to increase the distance between 
cloning sites so as to increase the effectiveness of 
the digestions. The 5 1 and 3* linkers for preparing 
the double-stranded linker insert into pHc2 were 5* 

15 TCGA66GTCGGTCGGTCTCTAGACG6TCGGTCGGTCA 3 1 (SEQ ID NO 

133) and 5' CTAGTGACCGACCGACCGTCTAGAGACCGACCGACCC 3 1 
(SEQ ID NO 134), respectively. The 5» and 3 ■ linkers 
for preparing the double-stranded linker insert into 
pLc2 were 5 f CGGTCGGTCGGTCCTCGAGGGTCGGTCGGTCT 3 • (SEQ 

20 ID NO 135) and 5' 

CTAGAGACCGACCGACCCTCGAGGACCGACCGACCGAGCT 3 1 (SEQ ID NO 
136) , respectively. The pairs of linker 
oligonucleotides were separately ligated to their 
respective digested, calf intestinal phosphatase- 

25 treated vectors. 

Subsequently, the multiple cloning sites of pHc2 
and pLc2 were transferred into the expression vectors, 
pFL281 and pFL261, respectively. To accomplish this 
process, the multiple cloning regions of both Lc2 and 

30 Hc2 were separately amplified by PCR as described by 

Gram et al., Proc. Natl. Acad. Sci.. USA . 89:3576-3580 
(1992) and as described in Example 2b using Vent 
Polymerase (New England Biolabs) according to the 
manufacturer's recommendations. The forward primer, 

35 5 f CAAGGAGACAGGATCCATGAAATAC 3« (SEQ ID NO 137) was 
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designed to provide a flush fusion of the pelB leader 
sequence to the ribosome binding sites of the cloning 
vectors pFL261 and pFL281 via its internal BamH I site 
indicated by the underlined nucleotides. The reverse 
5 primer 5' AGGGCGAATT6GATCCCG66CCCCC 3' (SEQ ID NO 138) 

was designed to anneal downstream of the region of 
interest in the parent vector of pHc2/pLc2 and create 
a second BamH I site. The resultant Hc2 and Lc2 PCR 
amplification products were then digested with BamH I 
10t to provide for BamH I overhangs for subsequent 

ligation into BamH I linearized pFL281 and pFL261 
vectors , respectively. The resulting light chain 
vector containing the Lc2 insert, designated pTCOl, 
was used in this form, whereas the heavy chain vector 
15 was further modified with a histidine tail to allow 
purification of Fab fragments by immobilized metal 
affinity chromatography as described by Skerra et al., 
Bio /Technology , 9:273-278 (1991). For this purpose, 
the synthetic linker oligonucleotides, respectively 
20 the 5 1 and 3» linkers, 5 1 

CTAGTCATCATCATCATCATT^AGCTAGC 3* (SEQ ID NO 139) and 
5* CTAGGCTAGCTTAATGATGATGATGATGA • 3 (SEQ ID NO 140) 
was inserted into the Spe I site, in effect removing 
the decapeptide tag sequence to generate the heavy 
25 chain vector designated as pTACOlH. The expression of 
Fab fragment in all subsequent cloning experiments was 
suppressed by adding 1% (w/v) glucose to all media and 
plates. 

b. Construction of Expression Plasmids 

For expression of the light chain variable 
domain, pTCOl prepared above was first digested with 
Sac I and Xba I; individual light chain inserts were 
then obtained by separately digesting 22 of the 
pComb2-3 plasmids prepared and screened as described 



30 



35 
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in Example 2 and listed in Figure 7 that bind to gp!20 
with the same combination of enzymes and isolating the 
0.7 kb fragment using low melting point agarose gel 
electrophoresis followed by b-agarose digestion. For 
5 the chain-shuffling experiments, the following 

representative members of each of the seven groups 
shown in Figure 7 were chosen: bll; b6; b4-b!2-b7-b21; 
b3; s8; bl-b!4-b24; bl3-b22-B26-b8-bl8-b27-B8-B35-s4 ; 
and one loop peptide-binding clone, p35. The 

10 different groups are indicated by semicolon 

separations while members of the same group are 
dashed. The resultant isolated light chains were 
separately ligated into PTC01 overnight at 16 °C under 
standard conditions using a 5:1 molar insert-to-vector 

15 ratio to form 21 light chain pTCOl expression vectors. 
For expression of the heavy chain variable 
domain, pTACOlH prepared above was first digested with 
Xho I and Spe I; heavy chain inserts were then 
obtained by separately PCR amplification reactions of 

20 the 20 pComb2-3 plasmids from which light chain 

inserts were obtained. PCR wa^ used to isolate the 
heavy chain inserts instead of restriction digestion 
in order to obtain heavy chain without the cpIII gene 
anchor sequence in the vector. For the PCR reaction, 

25 the respective 5 9 and 3 9 primers, 5' 

CAGGTGCAGCTCGAGGAGTCTGGG 3 9 (VHla) (SEQ ID NO 42) and 
5 9 GCATGTACTA6TTTT6TCACAAGATTT6GG 3 9 (CGlz) (SEQ ID NO 
44) were used to amplify the region corresponding to 
the heavy chain as described in Examples 2al) and 

30 2a2). The resultant PCR products were purified by 

low-melting point electrophoresis, digested with Xho I 
and Spe I, re-purified, and separately ligated to the 
similarly prepared heavy chain pTACOlH vector using a 
1:2 molar vector-to- insert ratio to form 21 heavy 

35 chain pTACOlH expression vectors. 
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c. Co-transformation of Binary Plasmids 

CaCl 2 -competent XLl-Blue cells (Stratagene; 
recAl, endAl, gyrA96, thi, hsdRl7, sup£44, relAl, lac, 
{F 1 proAB, lacl q , ZDH15 , TnlO(tet R )}) were prepared 
5 and transformed with approximately 0.5 fig purified DNA 
of each plasmid in directed crosses of each of the 20 
light chain vectors with each of the 20 heavy chain 
vectors. The presence of both plasmids and the 
episome was selected for by plating transformants on 

10 triple-antibiotic agar plates (100 tig /ml 

carbenicillin, 30 fig/nl chloramphenicol, 10 fig /ml 
tetracycline, 32 g/1 LB agar) containing 1% glucose. 

A binary plasmid system consisting of two 
repl icon-compatible plasmids was constructed as shown 

15 in 14. The pTACOlH heavy chain vector schematic is 
shown in Figure 14A and the pTCOl light chain vector 
schematic is shown in Figure 14B. Both expression 
vectors feature similar cloning sites including pel B 
leader sequences fused to the ribosome binding sites 

20 and the tac promoters via BamH I sites as shown in 

Figure 15. The nucleotide sequences of the multiple 
cloning sites along with the tac promoter, ribosome 
binding sites (rbs) and the underlined relevant 
restriction sites for the light chain vector, pTCOl, 

25 and heavy chain vector, pTACOlH, are respectively 

shown in Figure 15A and Figure 15B. The sequences are 
also listed in the Sequence Listing as described in 
the Brief Description of the Drawings. The heavy 
chain vector pTACOlH also contains a (His) 5 -tail to 

30 allow purification of the recombinant Fab fragments by 
immobilized metal affinity chromatography. The 
presence of both plasmids in the same bacterial cell 
is selected for by the presence of both antibiotics in 
the media. Expression is partially suppressed during 

35 growth by addition of glucose and induced by the 
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addition of IPTG at room temperature. Under these 
conditions, both plasmids are stable within the cell 
and support expression of the Fab fragment as assayed . 
by ELISA using goat anti-human kappa and goat 
5 anti-human XgGl antibodies* 

d. Preparation of Recombinant Fab Fragments 
Bacterial cultures for determination of 
antigen-binding activity were grown in 96 well-tissue 

10 culture plates (Costar #3596) . 250 /il Superbroth [SB 
had the following ingredients per liter: 10 g 3-(N- 
morpholino) propanesulfonic acid, 30 g tryptone, 20 g 
yeast extract at pH 7.0 at 25 °C) containing 30 tig /ml 
chloramphenicol, 100 pg/ml carbenicillin, and l% ■ / ^ f 

15 (w/v) ] glucose were admixed per well and inoculated 

with a single double-transformant prepared in Example 
6c above. The inoculated plates were then maintained • 
with moderate shaking (200 rpm) on a horizontal shaker 
for 7-9 hours at 37°C, until the was approximately 

20 1-1.5. The cells were collected by centrifugation of 
the microtiter plate (1,500 X g for 30 minutes at 
4°C) , the supernatants were discarded, and the cells 
were resuspended and induced overnight at room 
temperature in fresh media containing 1 mH IPTG, but 

25 no glucose. Cells were harvested by centrifugation, 

resuspended in 175 pi PBS (10 mM sodium phosphate, 160 
xrtM NaCl at pH 7.4 at 25°C) containing 34 /ig/ml 
phenylmethylsulfonyl fluoride (PMSF) and 1.5% (w/v) 
streptomycin sulfate, and lysed by 3 freeze-thaw 

30 cycles between -80°C and 37°C. The resultant crude 
extracts were partially cleared by centrifugation as 
above before analysis by antigen-binding ELISA. 



35 



e. Assay and Determination of Relative 
Affinities 
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Relative affinities were determined as 
described in Example 2b6) after coating wells with 0.1 
fig of antigen. The selected antigens included tetanus 
toxoid and recombinant gpl20 (strain IIIB) and gpl20 
(strain SF2) . For each antigen, a negative control 
extract of XLl-Blue cells co-transformed with pTCOl 
and pTACOlH was tested to determine whether other 
components in E. coli had any affinity for the 
antigens in the assay. Each extract was assayed for 
BSA-binding activity and BSA-positive clones were 
considered negative. All possible single- 
transformants expressing one chain only were prepared 
as described for the double-trans formants and were 
found to have no affinity for any of the antigens 
used. Because of the nature of the assay, whether 
this was due to a lack of binding by the individual 
chains itself or due to a lack of expression or 
folding could not be determined. 

f . Results of Direct Crosses of Heavy and Light 
Chains within^ a„ Set„ of gpl20/trol60 Binding 
Antibodies 

The Fab fragments derived from the bone 
marrow of the same asymptomatic HIV donor but panned 
against gpl20 (IIIB) , gp!60 (IIIB) , and gpl20 (SF2) , 
were assigned to one of seven groups based on the 
amino acid sequences of the CDR3 of their heavy chains 
as described in Example 4. From the same library, 
antibodies to the constrained hypervariable v3 -loop- 
like peptide JSISIGPGRAFYTGZC (SEQ ID NO 141) were 
isolated. For the chain-shuffling experiments, the 
following representative members of each of the seven 
groups shown in Figure 7 were chosen: bll; b6; b4-bl2- 
b7-b21; b3; s8; bl-bl4-b24; bl3-b22-B26-b8-bl8-b27-B8- 
B35-S4; and one loop peptide-binding clone, p35. 
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Clones b4 f b7, b!2, and b21 showed neutralization 
activity against HIV when monitoring inhibition of 
infection by syncytia formation and clones b!3, bl2, 
and b4 when monitoring p24 production as shown in 
5 Example 3. Light and heavy chains were cloned from 
the original constructs and cotransformed in all 
possible binary combinations into XLl-Blue cells as 
described above. 

The results of the complete cross are shown in 

10 Figure 16. As is to be expected, identical chains 
derived from different Fab fragments had similar 
binding properties e.g., b!8HC r b27HC, B8HC, B35HC, 
s4HC. The crosses of the original heavy chains with 
the original light chains in each case clearly 

15 recapitulated binding activity. Minor differences 
existed between some heavy chains with identical 
variable domain sequences, e.g., b4 and bl2 (constant 
domains were not sequenced for any of the constructs) • 
The exception is b8HC, which was identical in its 

20 variable domain to bl8HC, b27HC, B8HC, B35HC, s4HC, 

yet shows more cross reactivity. Presumably, this is 
due to differences in expression levels in the cell or 
differences in the constant domain sequences. Clear 
differences existed between heavy chains in their 

25 tendency to accept different light chains and still 
bind antigen, but even the least promiscuous heavy 
chain in the set panned against gp!20 (IIIB) , blHC, 
still did so in 43% of its crosses. On the other side 
of the spectrum, 5 heavy chains, bllHC, b6HC, b!2HC, 

30 b7HC, and b8HC, crossed productively with all light 
chains in this set. For the heavy chain crosses 
examined in detail (all of s4HC, B35HC, B26HC; most of 
bl2HC, bl2HC) , no significant differences in apparent 
binding affinity were found between Fab fragments 

35 using the same heavy chain but different light chains 
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as shown in Figure 17 where the IC^ from competition 
with soluble gpl20 (IIIB) was approximately 10' 8 M. 

Within the original seven groups that were 
established according to the sequence of the CDR3 of 
the heavy chains and that are indicated by horizontal 
and vertical lines in Figure 16, complete promiscuity 
was present, i.e., heavy and light chains within these 
CDR3 -determined groups were completely promiscuous 
with each other. However, there was a lack of 
promiscuity between other groups, e.g., between 
blHC-b24HC and b!3LC-s4LC. In the analysis of these 
sequence-based groups, the protein antigen against 
which the phage display library was panned was not a 
critical factor. The exception to this case was the 
cross of p35HC with all light chains; the only cross 
that bound either to gpl20 (SF2 strain) or the 
original antigen, the loop peptide, was the cross 
containing the original heavy and light chains. 

Unlike the heavy chains, no light chains crossed 
productively with all heavy chains nor were any 
distinguishable from the other light chains by 
unusually low promiscuity. 

In the neutralization assays performed as 
described in Example 3, the directed cross resulting 
from the pairing of the heavy chain from clone bl2 
with the light chain from clone b2l, was effective at 
neutralizing HTV-l. 

g. Interantiaenic Crosses of Heavy and Light 
Chains 

To determine whether conclusions derived 
from the crosses between high affinity Fab fragments 
originating from the same library can be extended to 
unrelated libraries, a non-related gammalk-Fab 
fragment (P3-13) specific for tetanus toxoid from a 
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different donor was chosen for a new set of crosses 
[clone 3 in Persson et al., Proc. Natl. Acad, sci.. 
OS A , 88:2432-2436 (1991)]. Extracts were probed with 
tetanus toxoid or with gp!20 (IIIB) • The data confirm 
5 the results from the gp!20 cross experiment in that 
the binding activity towards the antigen was 
determined by the heavy chain. The heavy chain of 
clone P3-13 paired with the light chains b4, bl2, b21, 
and b!4 to yield an Fab fragment with an affinity 

10 towards tetanus toxoid; the light chain of P3-13 

paired with the heavy chains of b3, b6, bll, and bl4 
to yield an Fab fragment with an affinity towards 
gpl20 (IIIB) . None of the light chains originating 
from the gpl20 binders was able to confer gpl20 

15 specificity in combination with the P3-13 heavy chain. 

Similarly, the P3-13 light chain was unable to 
generate tetanus toxoid specificity in combination 
with any of the heavy chains originating from the 
gpl20 binders, confirming the dominance of the heavy 

20 chain in the antibody-antigen interaction. 

Interestingly, all three light chains that showed a 
strong signal against tetanus toxoid (b4, bl2, b21) 
were members of the same group when sorted by the 
CDR3's of their original heavy chains. As might be 

25 expected from crosses between unrelated libraries, not 

only was there a lower degree of promiscuity, i.e., 
chains paired productively with far fewer 
complementary chains, but the range of apparent 
affinity constants determined by competition ELISA was 

30 much broader (6.3 X 10 6 - 6.3 X 10" 8 M) . The 

replacement of the original P3-13 light chain in the 
P3-13 Fab fragment with another light chain lowered 
the affinity of the Fab towards tetanus toxoid 10 to 
100-fold (from 6.3 X 10" 8 M to 6.3 X 10* 6 M) . In the 

35 crosses of the light chain of P3-13 with all the heavy 
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chains of the HIV pannings , the productive crosses had . 
similar affinities to gpl20 (IIIB) (2.5 X 10 7 - 6.3 X 
10" 7 M), with the exception of bl4HC/P3-13LC, whose [ 
signal was too weak for a definite determination of 
5 the apparent binding constant. These affinities were 
approximately five-fold lower than those of the 
gpl20-heavy chains with their original light chains. 

Thus, the results show that chain shuffling is 
yet another maneuver allowed in vitro but not in vivo 

10 which can be expected to help extend antibody 

diversity beyond that of Nature. The overriding 
feature of the binary system of this invention is its 
ability to create large numbers (several hundred) of 
directed crosses between characterized light and heavy 

15 chains without the need for recloning individual 
chains for each cross after the initial vector 
construction. When used in combination with the 
phage-display method and biological assays , it allows 
the rapid analysis of the most interesting subset of 

20 the pool of antigen-binding clones by chain shuffling, 
with the aim of finding biologically or chemically , 
active antibodies. For the set of antigens studied 
here, most heavy chains recombined with a number of 
light chains to yield an antigen-binding Fab fragment. 

25 These results have important implications for the . 

diversity of combinatorial antibody libraries. While 
it is not possible to predict reliably the original in 
vivo combinations of light and heavy chains due to the 
surprising promiscuity of individual chains, 

30 recombinant antibody libraries take advantage of the 

fact that even distantly related Fabs against the same 
antigen can recombine in vitro to give chain 
combinations not found in vivo . In fact, after the 
identification of a certain number of antibodies that 

35 have been shown to possess some biological or chemical 
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activity , it may be better to shuffle their individual 
chains in a directed fashion than to continue sampling 
randomly from the same pool of binders. By extension, 
the promiscuity observed in this system indicates that 
5 in libraries constructed using degenerate, chemically 
synthesized oligonucleotides, there should be 
considerable flexibility in which separate synthetic 
heavy chains can pair with separate synthetic light 
chains to generate separate antigen-binding Fab 
10 fragments. The diversity of combinatorial libraries 
coupled with chain-shuffling should allow wide 
exploration of three dimensional space thereby solving 
the problem of how to approximate molecules in the 
ternary complex of antibody, substrate and cof actor. 

15 

7. Deposit of Materials 

The following cell lines have been deposited on 
September 30, 1992, with the American Type Culture 
Collection (ATCC) , 1301 Parklawn Drive, Rockville, MD, 
20 USA: 

Cell Line 

E- coli MT11 
E. coli MT12 
25 E. coli MT13 

The deposits listed above, MT11, MT12 and MT13 
are bacterial cells f E. coli ) containing the 
expression vector pComb2-3 for the respective 

30 expression of the Fabs designated bll (clone bll) , bl2 
(clone bl2), and bl3 (clone bl3) prepared in Example 
2b. The sequences of the heavy and light chain 
variable domains are listed in Figure 10 and 11, 
respectively. This deposit was made with the ATCC 

35 under the provisions of the Budapest Treaty on the 



ATCC Accession No. 
ATCC 69078 
ATCC 69079 
ATCC 69080 
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International Recognition of the Deposit of 
Microorganisms for the Purpose of Patent Procedure and 
the Regulations thereunder (Budapest Treaty) . This 
assures maintenance of a viable culture for 30 years 
5 from the date of deposit. The organisms will be made 
available by ATCC under the terms of the Budapest 
Treaty which assures permanent and unrestricted 
availability of the progeny of the culture to the 
public upon issuance of the pertinent U.S. patent or 

10 upon laying open to the public of any U.S. or foreign 
patent application, whichever comes first, and assures 
availability of the progeny to one determined by the 
U.S. Commissioner of Patents and Trademarks to be 
entitled thereto according to 35 U.S.C. §122 and the 

15 Commissioner's rules pursuant thereto (including 37 
CFR §1.14 with particular reference to 886 OG 638). 
The assignee of the present application has agreed 
that if the culture deposit should die or be lost or 
destroyed when cultivated under suitable conditions, 

20 it will be promptly replaced on notification with a 

viable specimen of the same culture. Availability of 
the deposited strain is not to be construed as a 
license to practice the invention in contravention of 
the rights granted under the authority of any 

25 government in accordance with its patent laws. 

The foregoing written specification is considered 
to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not 

30 to be limited in scope by the cell lines deposited, 

since the deposited embodiment is intended as a single 
illustration of one aspect of the invention and any 
cell lines that are functionally equivalent are within 
the scope of this invention. The deposit of material 

35 does not constitute an admission that the written 



WOW/07922 



PCT/US93/09328 



130 

description herein contained is inadequate to enable 
the practice of any aspect of the invention, including 
the best mode thereof, nor is it to be construed as 
limiting the scope of the claims to the specific 
5 illustration that it represents. Indeed, various 

modifications of the invention in. addition to those 
shown and described herein will become apparent to 
those skilled in the art from the foregoing 
description and fall within the scope of the appended 
10 claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: 

(A) NAME: The Scripps Research Institute 

(B) STREET: 10666 North Torrey Pines Road 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 92037 

(G) TELEPHONE: 619-554-2937 

(H) TELEFAX: 619-554-6312 

(ii) TITLE OF INVENTION: HUMAN NEUTRALIZING MONOCLONAL ANTIBODIES 
TO HUMAN IMMUNODEFICIENCY VIRUS 

(iii) NUMBER OF SEQUENCES: 151 < 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0 1 Version #1.25 (EPO) 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US93/ 

(B) FILING DATE: 30- SEP- 1993 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/954,148 

(B) FILING DATE: 30 -SEP- 1992 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: v , 

GGCCGCAAAT TCTATTTCAA GGAGACAGTC ATAATGAAAT ACCTATT6CC TACGGCAGCC 60 
GCTGGATTGT TATTACTCGC TGCCCAACCA GCCATGGCCC AGGTGAaACT GCTCGAGATT 120 
TCTAGACTAG TTACCCGTAC GACGTTCCGG ACTACGGTTC TTAATAGAAT TCG 173 
(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO ' • ! \ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
TCGACGAATT CTATTAAGAA CCGTAGTCCG GAACGTCGTA CGGGTAACTA GTCTAGAAAT 60 
CTCGAGCAGT TTCACCTGGG CCATGGCTGG TTGGGCAGCG AGTAATAACA ATCCAGCGGC 120 
TGCCGTAGGC AATAGGTATT TCATTATGAC TGTCTCCTTG AAATAGAATT TGC 173 
(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear > * > ? 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TGAATTCTAA ACTAGTCGCC AAGGAGACAG TCATAATGAA ATACCTATTG CCTACGGCAG . .60 



WO 94/07922 



PCT/US93/09328 



133 

C06CTGGATT GTTATTACTC GCTGCCCAAC GAGCGATG6C CGAGCTCGTC AGTTCTAGAG 120 
TTAAGCGGCC G 131 
(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
TCGACGGCCG CTTAACTCTA GAACTGACGA GCTCGGCCAT GGCTGGTTGG GCAGCGAGTA 60 
ATAACAATCC AGCGGCTGCC GTAGGCAATA GGTATTTCAT TATGACTGTC TCCTTGGCGA 120 
CTAGTTTAGA ATTCAAGCT 139 
(2) INFORMATION FOR SEQ ID NO:5: 

{1} SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: N- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
1 5 10 15 

Ala Gin Pro Ala Met Ala Gin Val Lys Leu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: N- terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO :7: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
1 5 10 15 

Ala Gin Pro Ala Met Ala Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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TGTTGAGAAT TAATCATC6G CTCGTATAAT GTGTGGAATT GTGAGCGGAT AACAATTTCA 60 

CACAGGAGGA AGGATCCATG AAATACCTAT TGCCTACGGC AGCCGCTGGA TTGTTATTAC 120 

TCGCTCCCGA ACGAGCCATG GCCGAGCTCG GTCGGTCGGT CCTCGAGGGT CGGTCGGTCT 180 

CTAGAGTTAA GCGGCCGC 198 
.(2) INFORMATION FOR SEQ ID NO: 9: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
GCGGCCGCTT AACTCTAGAG ACCGACCGAC CCTCGAGGAC CGACCGACCG AGCTCGGCCA 60 
TGGCTGGTTG GGCAGCGAGT AATAACAATC CAGCGGCTGC CGTAGGCAAT AGGTATTTCA 120 
TGGATCCTTC CTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAT TATACGAGCC 180 
GATGATTAAT TGTCAACA 198 
(2) INFORMATION FOR. SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: N- terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Lys Thr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
15 10 15 
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Ala Gin Pro Ala Met Ala Glu Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TGTTGACAAT TAATCATCGG CTCGTAIAAT GTGTGGAATT GTGAGCGGAT AACAATTTCA 60 

CACAGGAGGA AGGATCCATG AAATACCTAT TGCCTACGGC AGCCGCTGGA TTGTTATTAC 120 

TCGCTGCCCA ACCAGCCATG GCCCAGGTGA AACTGCTCGA GGGTCGGTCG GTCTCTAGAC 180 

GGTCGGTCGG TCACTAGTCA TCATCATCAT CATTAAGCTA 220 
(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TAGCTTAATG ATGATGATGA TGACTAGTGA CCGACCGACC GTCTAGAGAC CGACCGACCC 60 
TCGAGCAGTT TCACCTGGGC CATGGCTGGT TGGGCAGCGA GTAATAACAA TCCAGCGGCT 120 
GCCGTAGGCA ATAGGTATTT CATGGATCCT TCCTCCTGTG TGAAATTGTT ATCCGCTCAC 180 
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AATTCCACAC ATTATACGAG CCGATGATTA ATTCTCAACA 220 
(2) INFORMATION FOR SEQ ID NO:13: 

(I) SEQUENCE CHARACTERISTICS: ' ! ! - - ; '\? 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid .'W.O. .»; -i ■ '.-A'-./'V 1 .-i'iYf 
(D) TOPOLOGY: linear A /A 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: N- terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Lys Thr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 

1 5 10 . . ,,, .15.:... 

Ala Gin Pro Ala Met Ala Gin Val Lys Leu Leu Glu v 

20 ... 25 • • 

(2) INFORMATION FOR SEQ ID N0:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide* 

(v) FRAGMENT TYPE: C- terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Thr Ser His His His His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ^ 

(D) TOPOLOGY: linear ; 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGCCGCAAAT TCTATTTCAA GGAGACAGTC AT 32 
(2) INFORMATION FOR SEQ ID N0:16: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
AATGAAATAC CTATTGCCTA CGGCAGCCGC TGGATT 36 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GTTATTACTC GCTGCCCAAC CAGCCATGGC CC 32 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CAGTTTCACC TGGGCCATGG CTGGTTGGG 29 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 
CA6C6A6TAA TAAGAATCGA 6C66CT6CCG TAG6CAATAG 40 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GTATTTCATT ATGACTGTCT CCTTGAAATA GAATTTGC 38 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: AO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AGGTGAAACT GCTCGAGATT TCTAGACTAG TTACCCGTAC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: liiiear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



AO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 
CGGAACGTCG TACGGGTAAC TAGTCTAGAA ATCTCGAG 38 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GACGTTCCGG ACTACGGTTC TTAATAGAAT TCG 33 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

r 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TCGACGAATT CTATTAAGAA CCGTAGTC 28 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TGAATTCTAA ACTAGTCGCC AAGGAGACAG TCAT 



34 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear , 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



' 1 } 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AATGAAATAC CTATTGCCTA CGGCAGCCGC TGGATT t 36 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single / 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GTTATTACTC GCTGCCCAAC CAGCCATGGC C \ ;3l 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GAGCTCGTCA GTTCTAGAGT XAAGCGGCCG 30 
.(2) INFORMATION FOR SEQ ID NO: 29: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
.(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GTATTTCATT ATGACTGTCT CCTTGGCGAC TAGTTTAGAA TTCAAGCT 48 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
;. (iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CAGCGAGTAA TAACAATCCA GCGGCTGCCG TAGGCAATAG 40 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 27 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TGAC6AGCTC GGCCATGGCT GGTTGGG 27 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI* SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TCGACGGCCG CTTAACTCTA GAAC 24 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 666 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CCATTCGTTT GTGAATATCA AGGCCAAGGC CAATCGTCTG ACCTGCCTCA ACCTCCTGTC 60 

AATGCTG6CG GCGGCICTGG TGGTGGTTCT GGTGGCGGCT CTGAG6GTGG TGGCTCTGAG 120 

GGTGGCGGTT CT6A6GGTGG CGGCTCTGAG GGAGGCGGTT CCGGTGGTGG CTCTGGTTCC 180 

GGTGATTTTG ATTATGAAAA GATGGGAAAC GCTAATAAGG GGGCTATGAC GGAAAATGCC 240 

GATGAAAACG CGCTACAGTC TGACGCTAAA GGCAAACTTG ATTCTGTCGC TACTGATTAC 300 

GGTGCTGCTA TCGATGGTTT CATTGGTGAC GTTTCCGGCC TTGCTAATGG TAATGGTGCT 360 

ACTGGTGATT TTGCTGGCTC TAATTCCCAA ATGGCTCAAG TCGGTGACGG TGATAATTCA 420 

CCTTTAATGA AIAATTTCCG TCAATATTTA CCTTCCCTCC CTCAATCGGT TGAATGTCGC 480 

CCTTTTGTCT TTAGCGCTGG TAAACCATAT GAATTTTCTA TTGATTGTGA CAAAATAAAC 540 

TTATTCGGTG TCTTTGCGTT TCTTTTATAT GTTGCGACCT TTATGTATGT ATTTTCTACG 600 

TTTGCTAACA TACTGCGTAA TAAGGAGTCT TAATCATGCC AGTTCTTTTG GGTATTCCGT 660 

TATTAT 666 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid. 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Pro Phe Val Cys Glu Tyr Gin Gly Gin Gly Gin Ser Ser Asp Leu Pro 
1 5 10 15 

Gin Pro Pro Val Asn Ala Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly 
20 25 30 

Gly Ser Glu Gly Gly Gly Ser Glu Gly Gly Gly Ser Glu Gly Gly Gly 
35 40 45 
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Ser Glu Gly Gly Gly Ser Gly Gly Gly Ser Gly Ser Gly Asp Phe Asp 
50 55 60 

Tyr Glu Lys Met Ala Asn Ala Asn Lys Gly Ala Met Thr Glu Asn Ala 
65 70 75 80 

Asp Glu Asn Ala Leu Gin Ser Asp Ala Lys Gly Lys Leu Asp Ser Val 
85 90 95 

Ala Thr Asp Tyr Gly Ala Ala lie Asp Gly Phe lie Gly Asp Val Ser 
100 105 110 

Gly Leu Ala Asn Gly Asn Gly Ala Thr Gly Asp Phe Ala Gly Ser Asn 
115 120 125 

Ser Gin Met Ala Gin Val Gly Asp Gly Asp Asn Ser Pro Leu Met Asn 
130 135 140 

Asn Phe Arg Gin Tyr Leu Pro Ser Leu Pro Gin Ser Val Glu Cys Arg 
145 150 155 160 

Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr Glu Phe Ser lie Asp Cys 
165 170 175 

Asp Lys lie Asn Leu Phe Arg Gly Val Phe Ala Phe Leu Leu Tyr Val 
180 185 190 

Ala Thr Phe Met Tyr Val Phe Ser Thr Phe Ala Asn lie Leu Arg Asn 
195 200 205 

Lys Glu Ser 
210 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GAGACGACTA GTGGTGGCGG TGGCTCTCCA TTCGTTTGTG AATATCAA 



48 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
TTACTAGCTA GCATAATAAC GGAATACCCA AAAGAACTGG 40 
(2) INFORMATION FOR SEQ ID NO:37: . " ^V/^sS ^ 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
TATGCTAGCT AGTAACACGA CAGGTTTCCC GACTGG ,V V/ '.v^ ; \;.^:; V^Y\ V.Y36 
(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: ; - 1 

(A) LENGTH: 27 base pairs \\, Y\;^ % -\ Y 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear \/ : / 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
ACCGAGCTC6 AATTCGTAAT CATGGTC 27 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
AGCTGTTGAA TTCGTGAAAT TGTTATCCGC T 31 
(2) INFORMATION FOR SEQ ID NO: 40: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 
GAGACGACTA GTGGTGGCGG TGGCTCTCCA TTCGTTTGTG AATATCAAGG CCAAGGCCAA 60 
TCGTCTGACC TGCCTCAACC TCCTGTCAAT GCTGGCGGCG GCTCTGGTGG TGGTTCTGGT 120 
GGCGGCTCTG AGGGTGGTGG CTCTGAGGGT GGCGGTTCTG AGGGTGGCGG CTCTGAGGGA 180 
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GGCGGTTCCG GTGGTGGCTC TGGTTCCGGT GATTTTGATT AT6AAAAGAT GGCAAACGCT 240 

AATAAGGGGG CTATGACCGA AAATGCCGAT GAAAACGCGC TACAGTCTGA CGCTAAAGGC 300 

AAACTTGATT CTGTCGCTAC TGATTACGGT GCTGCTATCG ATGGTTTCAT TGGTGACGTT 360 

TCCGGCCTTG CTAATGGTAA TGGTGCTACT GGTGATTTTG CTGGCTCXAA TTCCCAAATG 420 

GCTCAAGTCG GTGACGGTGA TAATTCACCT TTAATGAATA ATTTCCGTGA ATATTTACCT 480 

TCCCTCCCTC AATCGGTTGA ATGTCGCCCT TTTGTCTTTA GCGCTGGTAA ACCATATGAA 540 

TTTTCTATTG ATTGTGACAA AATAAACTTA TTCCGTGGTG TCTTTGCGTT TCTTTTATAT 600 

GTTGCGACCT TTATGTATGT ATTTTCTACG TTTGCTAACA TACTGCGTAA TAAGGAGTCT 660 

TAATCATGCC AGTTCTTTTG GGTATTCCGT TATTATGCTA GCTAGTAA 708 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

TATGCTAGCT AGTAACACGA CAGGTTTCCC GACTGGAAAG CGGGCAGTGA GCGGAACGCA 60 

ATTAATGTGA GTTAGCTCAC TCATTAGGCA CCCCAGGCTT TAGACTTTAT GCTTCCGGCT 120 

CGTATGTTGT GTGGAATTGT GAGCGGATAA CAATTTCACA CAGGAAACAG CTATGACCAT 180 

GATTACGAAT TCGAGCTCGG T 201 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
CAGGTGCAGC TCGAGCAGTC TGGG 24 
(2) INFORMATION FOR SEQ ID N0:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
GAGGTGCAGC TGGAGGAGTC TGGG ?4 
(2) INFORMATION FOR SEQ ID N0:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO _ 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GCATGTACTA GTTTTGTCAC AAGATTTGGG \ 30 
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(2) INFORMATION FOR SEQ ID N0:45: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(aci) SEQUENCE DESCRIPTION : SEQ ID NO: 45: 
GACATCGAGC TCACCCAGTC TCCA 24 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:46: 
GAAATTGAGC TCACGCAGTC TCCA 24 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO; 47: flV Y ^v'/ ! ? 

GCGCC6TCTA GAACTAACAC TCTCCCCTGT TGAAGCTCTT TGTGACGGGC AAG / V , . t V ( j ; : 53 ; "/ 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:48: ' 

Ser lie Ser Gly Pro Gly Arg Ala Phe Tyr Thr Gly v; 

1 5 v> :• - 10 :- / ' . * 

(2) INFORMATION FOR SEQ ID NO:49: V 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO ;'./ 'A'; , , /;'^-: 

(iv) ANTI-SENSE: NO ^ • 

. "■ ' . ■ '-^ - . 'A^ 1 ': ; V ^ ■ V'".: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:49: / • 

. •• •• • A* . W:"V V; • << • 

GTCGTTGACC AGGCAGCCCA G 21 . ; 

(2) INFORMATION FOR SEQ ID NO:50: /■'} /; : V ; \VVv)vV. fV ; i = *; 

(i) SEQUENCE CHARACTERISTICS: \ /: / -V t ^ 

(A) LENGTH: 21 base pairs . \.\ ^ 

(B) TYPE: nucleic acid 



1 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
ATAGAAGTTG TTCAGCAGGC A 21 
(2) INFORMATION FOR SEQ ID NO: 51: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
ATTAACCCTC ACTAAAG 17 
(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
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GAATTCTAAA CTAGCTAGTT CG 22 
(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Leu Glu Glu Ser Gly Thr Glu Phe Lys Pro Pro Gly Ser Ser Val Lys 
1 5 10 15 

Val Ser Cys Lys Ala Ser Gly Gly Thr Phe Gly Asp Tyr Ala Ser Asn 
20 25 30 

Tyr Ala lie Ser Trp Val Arg Gin Ala Pro Gly Gin Gly Leu Glu Tyr 
35 40 45 

lie Gly Gly lie Thr Pro Thr Ser Gly Ser Ala Asp Tyr Ala Gin Lys 
50 55 60 

Phe Gin Gly Arg Val Thr lie Ser Ala Asp Arg Phe Thr Pro lie Leu 
65 70 75 80 

Tyr Met Glu Leu Arg Ser Leu Arg lie Glu Asp Thr Ala lie Tyr Tyr 
85 90 95 

Cys Ala Arg Glu Arg Arg Glu Arg Gly Trp Asn Pro Arg Ala Leu Arg 
100 105 110 

Gly Ala Leu Asp Phe Trp Gly Gin Gly Thr Arg Val Phe Val Ser Pro 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
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Leu Glu Glu Ser Gly Ala Ala Val Gin Lys Pro Gly Ser Ser Val Arg 
1 5 10 15 

Val Ser Cys Gin Ala Ser Gly Gly Thr Phe Asp Asn Fhe Ala Ser Asn 
20 25 30 

Tyr Ala Val Ser Trp Val Arg Gin Ala Fro Gly Gin Gly Leu Glu Trp 
35 40 45 

Met Gly Gly lie Thr Pro Thr Ser Gly Thr Ala Thr Tyr Ser Gin Lys 
50 55 60 

Fhe Gin Gly Arg Val Thr He Ser Ala Ala Pro Leu Thr Pro He He 
65 70 75 80 

Tyr Met Glu Leu Arg Ser Leu Arg Asp Asp Asp Thr Ala Val Tyr Tyr 
85 90 95 

Cys Ala Arg Glu Arg Arg Glu Arg Gly Trp Asn Pro Arg Ala Leu Val 
100 105 110 

Gly Ala Leu Asp Val Trp Gly Gin Gly Thr Thr Val 
115 120 

(2) INFORMATION FOR SEQ ID N0:55: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Leu Glu Glu Ser Gly Thr Glu Phe Lys Pro Pro Gly Ser Ser Val Lys 
1 5 10 15 

Val Ser Cys Lys Ala Ser Gly Gly Thr Phe Gly Asp Tyr Ala Ser Asn 
20 25 T 30 

Tyr Ala He Ser Trp Val Arg Gin Ala Pro Gly Gin Gly Leu Glu Tyr 
35 40 45 

He Gly Gly He Thr Pro Thr Ser Gly Ser Ala Asp Tyr Ala Gin Lys 
50 55 60 

Fhe Gin Gly Arg Val Thr He Ser Ala Asp Arg Fhe Thr Pro He Leu 
65 70 75 80 
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Tyr Met Glu Leu Arg Ser Leu Arg He Glu Asp Thr Ala He Tyr Tyr 
85 90 95 

Cys Ala Arg Glu Arg Arg Glu Arg Gly Trp Asn Pro Arg Ala Leu Arg 
100 105 HO 

Gly Ala Leu Asp Fhe Trp Gly Gin Gly Thr Arg Val Fhe Val Ser Pro 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Leu Glu Glu Ser Gly Ala Glu Val Lys Lys Pro Gly Ser Ser Val Lys 
1 5 10 15 

Val Ser Cys Lys Ala Ser Gly Gly He Fhe Ser Asp Fhe Ala Ser Asn 
20 25 30 

Tyr Ala He Ser Trp Val Arg Gin Ala Fro Gly Gin Gly Leu Glu Tyr 
35 *0 4 5 

Met Gly Gly He Thr Pro Thr Ser Gly Ser Ala Asp Tyr Ala Gin Lys 
50 55 60 

Fhe Gin Gly Arg Val Thr He Ser Ala Asp Ala Ala Thr Pro Arg Val 
65 70 75 80 

Tyr Met Glu Leu Arg He Leu Arg Ser Glu Asp Thr Ala Val Tyr Phe 
85 90 95 

Cys Ala Arg Glu Arg Arg Glu Arg Gly Trp Asn Pro Arg Ala Leu Arg 
100 105 110 

Gly Ala Leu Glu Val Trp Gly Gin Gly Thr Thr Val He Val Ser Pro 
115 120 125 



(2) INFORMATION FOR SEQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Leu Glu Glu Ser Gly Ala Ala Val Gin Lys Pro Gly Ser Ser Val Arg 
15 10 15 

Val Ser Cys Gin Ala Ser Gly Gly Thr Phe Asp Asn Phe Ala Ser Asn 
20 25 30 

Tyr Ala Val Ser Trp Val Arg Gin Ala Pro Gly Gin Gly Leu Glu Trp 
35 40 45 

Met Gly Gly He Thr Pro Thr Ser Gly Thr Ala Thr Tyr Ser Gin Lys 
50 55 60 

Phe Gin Gly Arg Val Thr He Ser Ala Ala Pro Leu Thr Pro He He 
65 70 75 80 

Tyr Met Glu Leu Arg Ser Leu Arg Asp Asp Asp Thr Ala Val Tyr Tyr 
85 90 95 

Cys Ala Arg Glu Arg Arg Glu Arg Gly Trp Asn Pro Arg Ala Leu Val 
100 105 110 

Gly Ala Leu Asp Val Trp Gly Gin Gly Thr Thr Val II© Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Leu Glu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ser Ser Val Lys 
1 5 10 15 

Val Ser Cys Lys Thr Ser Gly Gly Thr Phe Ser Asp Tyr Ala Ser Asn 
20 25 30 
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His Ala lie Ser Trp Val Arg Gin Ala Pro Gly Gin Gly Leu Glu Tyr 
35 40 45 

Met Gly Gly He Thr Pro Thr Ser Gly Thr Ala Asp Tyr Ala Gin Lys 
50 55 60 

Fhe Gin Ala Arg Val Thr He Ser Ala His Glu Phe Thr Pro He Val 
65 70 75 80 

Tyr Met Glu Leu Arg Ser Leu Arg Ser Asp Gin His Ala Thr Tyr Tyr 
85 90 95 

Cys Ala Thr Glu Arg Arg Glu Arg Gly Trp Asn Pro Arg Ala Leu Arg 
100 105 110 

Gly Ala Leu Asp He Trp Gly Gin Gly Thr Thr Val He Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 59: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<*i) SEQUENCE DESCRIPTION; SEQ ID NO: 59: 

Leu Glu Glu Ser Gly Gly Arg Leu Val Lys Pro Gly Gly Ser Leu Arg 
1 5 10 15 

Leu Ser Cys Glu Gly Ser Gly Phe Thr Phe Thr Asn Ala Trp Met Thr 
20 25 30 

Trp Val Arg Gin Ser Pro Gly Lys Gly Leu Glu Trp Val Ala Ser He 
35 40 45 

Lys Ser Lys Phe Asp Gly Gly Ser Pro His Tyr Ala Ala Pro Val Glu 
50 55 60 

Gly Arg Phe Ser He Ser Arg Asn Asp Leu Glu Asp Lys Met Fhe Leu 
65 70 75 80 

Glu Met Ser Gly Leu Lys Ala Glu Asp Thr Gly Val Tyr Tyr Cys Ala 
85 90 95 



Thr Lys Tyr Pro Arg Tyr Ser Asp Met Val Thr Gly Val Arg Asn His 
100 105 110 
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Phe Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val lie Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 60: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Leu Glu Gin Ser Gly Gly Gly Leu Val Lys Pro Gly Gly Ser Leu Arg 
15 10 15 

Leu Ser <3ys Glu Gly Ser Gly Phe Thr Phe Thr Asn Ala Trp Met Thr 
20 25 30 

Trp Val Arg Gin Ser Pro Gly Lys Gly Leu Glu Trp Val Ala Ser lie 
35 40 45 

Lys Ser Lys Phe Asp Gly Gly Ser Pro His Tyr Ala Ala Pro Val Glu 
50 55 60 

Gly Arg Phe Thr lie Ser Arg Asn Asp Leu Glu Asp Lys Leu Phe Leu 
65 70 75 80 

Glu Met Ser Gly Leu Lys Ala Glu Asp Thr Gly Val Tyr Tyr Cys Ala 
85 90 95 

Thr Lys Tyr Pro Arg Tyr Phe Asp Met Met Ala Gly Val Arg Asn His 
1O0 105 110 

Phe Tyr Met Asp Val Trp Gly Thr Gly Thr Thr Val lie Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Leu Glu Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly Ser Leu Arg 
1 5 10 15 

Leu Ser Cys Glu Gly Ser Gly Fhe Thr Phe Thr Asn Ala Trp Met Thr 

20 25 30 

< t 

Trp Val Arg Gin Ser Pro Gly Lys Gly Leu Glu Trp Val Ala Ser lie 
35 40 45 

Lys Ser Lys Phe Asp Gly Gly Ser Pro His Tyr Ala Ala Pro Val Glu 
50 55 60 

Gly Arg Fhe Thr lie Ser Arg Asn Asp Leu Glu Asp Lys Leu Fhe Leu 
65 70 75 80 

Glu Met Ser Gly Leu Lys Ala Glu Asp Thr Gly Val Tyr Tyr Cys Ala 
85 90 95 

Thr Lys Tyr Pro Arg Tyr Ser Asp Met Met Ala Gly Val Arg Asn His 
100 105 110 

Leu Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val lie Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Leu Glu Glu Ser Gly Gly Arg Leu Val Lys Pro Gly Gly Ser Leu Arg 
1 5 10 15 

Leu Ser Cys Glu Ala Ser Gly Fhe Thr Phe Thr Asn Ser Trp Met Thr 
20 25 30 

Trp Val Arg Gin Ser Pro Gly Lys Gly Leu Glu Trp Val Ala Ser lie 
35 40 45 

Lys Arg Lys Phe Asp Gly Gly Ser Pro His Tyr Ala Ala Pro Val Glu 
50 55 60 
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Gly Arg Fhe Ser lie Ser Arg Asn Asp Leu Glu Asp Lys Met Fhe Leu 
65 70 75 80 

Glu Met Ser Gly Leu Lys Ala Glu Asp Thr Gly Val Tyr Tyr Gys Ala 
85 90 95 

Thr Lys Tyr Pro Arg Tyr Ser Asp Met Met Thr Gly Val Arg Asn His 
100 105 110 

Fhe Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val lie Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 63: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Leu Glu Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly Ser Leu Arg 
1 5 10 15 

Leu Ser Cys Glu Ser Ser Gly Fhe Thr Fhe Thr Asn Ala Trp Met Thr 
20 25 30 

Trp Val Arg Gin Ser Pro Gly Lys Gly Leu Glu Trp Val Ala Ser lie 
35 40 45 

Lys Ser Lys Fhe Asp Gly Gly Ser Pro His Tyr Ala Ala Pro Val Glu 
50 55 60 

Gly Arg Fhe Thr lie Ser Arg Asn Asp Leu Glu Asp Lys Leu Fhe Leu 
65 70 75 80 

Glu Met Ser Gly Leu Lys Ala Glu Asp Thr Gly Val Tyr Tyr Cys Ala 
85 90 95 

Thr Lys Tyr Pro Arg Tyr Ser Asp Met Met Ala Gly Val Arg Asn His 
100 105 110 

Fhe Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val lie Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 64: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Glu Glu Ser Gly Gly Arg Leu Val Lys Pro Gly Gly Ser Leu Arg 
1 5 10 15 

Leu Ser Cys Glu Gly Ser Gly Phe Thr Phe Thr Asn Ala Trp Met Thr 
20 25 30 

Trp Val Arg Gin Ser Pro Gly Lys Gly Leu Glu Trp Val Ala Ser He 
35 40 45 

Lys Ser Lys Phe Asp Gly Gly Ser Pro His Tyr Ala Ala Pro Val Glu 
50 55 60 

Gly Arg Phe Ser He Ser Arg Asn Asp Leu Glu Asp Lys Met Phe Leu 
65 70 75 80 

Glu Met Ser Gly Leu Lys Ala Glu Asp Thr Gly Val Tyr Tyr Cys Ala 
85 90 95 

Thr Lys Tyr Pro Arg Tyr Ser Asp Met Met Thr Gly Val Arg Asn His 
100 105 110 

Phe Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val He Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Leu Glu Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly Ser Leu Arg 
1 5 10 15 



( 
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Leu Ser Cys Ala Gly Ser Gly Phe Thr Phe Thr Asn Ala Trp Met Thr 
20 25 30 

Trp Val Arg Gin Ser Pro Gly Lys Gly Leu Glu Trp Val Ala Ser lie 
35 40 45 

Lys Ser Lys Phe Asp Gly Gly Ser Ser His Tyr Pro Gly Pro Val Glu 
50 55 60 

Gly Arg Phe Thr lie Ser Arg Asn Tyr lie Glu Asp Lys Leu Phe Leu 
65 70 75 80 

Glu Met Ser Gly Leu Lys Ala Glu Asp Thr Gly Val Tyr Tyr Cys Ala 
85 90 95 

Thr Lys Tyr Pro Arg Tyr Tyr Asp Met Met Arg Gly Val Arg Asn His 
100 105 110 

Tyr Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val He Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Leu Glu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala Ser Val Lys 
15 10 15 

Val Ser Cys Gin Ala Ser Gly Tyr Arg Phe Ser Asn Phe Val lie His 
20 25 30 



Trp Val Arg Gin Ala Pro Gly Gin Arg Phe Glu Trp Met Gly Trp He 
35 40 45 

Asn Pro Tyr Asn Gly Asn Lys Glu Phe Ser Ala Lys Phe Gin Asp Arg 
50 55 60 

Val Thr Phe Thr Ala Asp Thr Ser Ala Asn Thr Ala Tyr Met Glu Leu 
65 70 75 80 

. Arg Ser Leu Arg Ser Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg Val 
85 90 95 



WO 94/07922 



PCT/US93/09328 



164 

Gly Pro Tyr Ser Trp Asp Asp Ser Pro Gin Asp Asn Tyr Tyr Met Asp 
100 105 110 

Val Trp Gly Lys Gly Thr Thr Val lie Val Ser Ser 
115 120 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Leu Glu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala Ser Val Lys 
15 10 15 

Val Ser Cys Gin Ala Ser Gly Tyr Arg Phe Ser Asn Phe Val He His 
20 25 30 

Trp Val Arg Gin Ala Pro Gly Gin Arg Phe Glu Trp Met Gly Trp He 
35 40 45 

Asn Pro Tyr Asn Gly Asn Lys Glu Phe Ser Ala Lys Phe Gin Asp Arg 
50 55 60 

Val Thr Phe Thr Ala Asp Thr Asp Ala Asn Thr Ala Tyr Met Glu Leu 
65 70 75 80 

Arg Ser Leu Arg Ser Ala Asp Thr Ala He Tyr Tyr Cys Ala Arg Val 
85 90 95 

Gly Pro Tyr Thr Trp Asp Asp Ser Pro Gin Asp Asn Tyr Tyr Met Asp 
100 105 110 

Val Trp Gly Lys Gly Thr Lys Val He Val Ser Ser 
115 120 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Leu GIu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala Ser Val Lys 
1 5 10 15 

Val Ser Cys Gin Ala Ser Gly Tyr Arg Phe Ser Asn Phe Val lie His 
20 25 30 

Trp Val Arg Gin Ala Pro Gly Gin Arg Phe Glu Trp Met Gly Trp lie 
35 40 45 

Asn Pro Tyr Asn Gly Asn Lys Glu Phe Ser Ala Lys Phe Gin Asp Arg 
50 55 60 

Val Thr Phe Thr Ala Asp Thr Asp Ala Asn Thr Ala Tyr Met Glu Leu 
65 70 75 80 

Arg Ser Leu Arg Ser Thr Asp Thr Ala lie Tyr Tyr Cys Ala Arg Val 
85 90 ; 95 

Gly Pro Tyr Thr Trp Asp Asp Ser Pro Gin Asp Asn Tyr Tyr Met Asp 
100 105 110 

Val Trp Gly Lys Gly Thr Lys Val lie Val Ser Ser 
115 120 V 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids ^ / 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:69: 

Leu Glu Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly Ser Leu Arg 
1 5 10 15 

Leu Ser Cys Val Gly Ser Gly Phe Thr Phe Ser Ser Ala Trp Met Ala 
20 25 30 

Trp Val Arg Gin Ala Pro Gly Arg Gly Leu Glu Trp Val Gly Leu He 
35 40 45 

Lys Ser Lys Ala Asp Gly Glu Thr Thr Asp Tyr Ala Thr Pro Val Lys 
50 55 ; 60 
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Gly Arg Phe Ser lie Ser Arg Asn Asn Leu Glu Asp Thr Val Tyr Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Ala Asp Asp Thr Ala Val Tyr Tyr Cys Ala 
85 90 95 

Thr Gin Lys Pro Arg Tyr Phe Asp Leu Leu Ser Gly Gin Tyr Arg Arg 
100 105 110 

Val Ala Gly Ala Phe Asp Val Trp Gly His Gly Thr Thr Val Thr Val 
115 120 125 

Ser Pro 
130 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Leu Glu Glu Ser Gly Gly Gly Leu Val Lys Ala Gly Gly Ser Leu Arg 
15 10 15 

Leu Ser Cys Val Gly Ser Gly Phe Thr Phe Ser Ser Ala Trp Met Ala 
20 25 30 

Trp Val Gly Gin Ala Pro Gly Arg Gly Leu Glu Trp Val Gly Leu He 
35 40 45 

Lys Ser Lys Ala Asp Gly Glu Thr Thr Asp Tyr Ala Thr Pro Val Lys 
50 55 60 

Gly Arg Phe Ser He Ser Arg Asn Asn Leu Glu Asp Thr Val Tyr Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Ala Asp Asp Thr Ala Val Tyr Tyr Cys Ala 
85 90 95 

Thr Gin Lys Pro Arg Tyr Phe Asp Leu Leu Ser Gly Gin Tyr Arg Arg . 
100 105 110 

Val Ala Gly Ala Phe Asp Val Trp Gly His Gly Thr Thr Val Thr Val 
115 120 125 
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Ser Pro 
130 

(2) INFORMATION FOR SEQ ID NO:71: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 130 amino, acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Leu Glu Glu Ser Gly Gly Gly Leu lie Lys Pro Gly Gly Ser Leu Arg 
1 5 10 15 

Leu Ser Cys Val Gly Ser Gly Phe Thr Phe Ser Ser Ala Trp Met Thr 
20 25 30 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie Gly Leu lie 
35 40 45 

Lys Ser Lys Ala Asp Gly Glu Thr Thr Asp Tyr Ala Thr Pro Val Lys 
50 55 60 

Gly Arg Phe Thr lie Ser Arg Asn Asn Leu Glu Asn Thr Val Tyr Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Ala Asp Asp Thr Ala Val Tyr Tyr Cys Ala 
85 90 95 

Thr Gin Lys Pro Ser Tyr Tyr Asn Leu Leu Ser Gly Gin Tyr Arg Arg 
100 105 110 

Val Ala Gly Ala Phe Asp Val Trp Gly His Gly Thr Thr Val Thr Val 
115 120 125 

Ser Pro 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 72: 

Leu Glu Glu Ser Gly Glu Ala Val Val Gin Pro Gly Arg Ser Leu Arg 

1 5 10 J 15 • ( 

Leu Ser Cys Ala Ala Ser Gly Phe lie Phe Arg Asn Tyr Ala Met His 

20 25 ; 30 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Leu lie 
35 40 45 

Lys Tyr Asp Gly Arg Asn Lys Tyr Tyr Ala Asp Ser Val Lys Gly Arg ' 
50 55 60 

Phe Thr lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr Leu Gin Met 
65 70 75 80 

Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala Arg Asp . 

85 • • 90 ' ' • 95 •■•-jtt 

\\ ■ • ,* 

lie Gly Leu Lys Gly Glu His Tyr Asp lie Leu Thr Ala Tyr Gly Pro , 
100 105 110 

Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
115 120 125 

(2) INFORMATION FOR SEQ ID N0:73: 

(i) SEQUENCE CHARACTERISTICS : 

(k) LENGTH: 125 amino acids 
(£) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:73: > ^ ; 

Leu Glu Gin Ser Gly Glu Ala Val Val Gin Pro Gly Thr Ser Leu Arg 
1 5 10 15 

Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Arg Asn Tyr Ala Met His 

20 25 -v 30 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Leu lie \ 

35 40 < 45 • 

Lys Tyr Asp Gly Arg Asn Lys Tyr Tyr Ala Asp Ser Val Lys Gly Arg 

50 55 60 ' A 
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Phe Ser lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr Leu Glu Met 
65 70 75 80 

Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala Arg Asp 
85 90 95 

lie Gly Leu Lys Gly Glu His Tyr Asp lie Leu Thr Ala Tyr Gly Pro 
100 105 110 

Asp Tyr Trp Gly Gin Gly Ala Leu Val Thr Val Ser Ser 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Leu Glu Gin Ser Gly Glu Ala Val Val Gin Pro Gly Arg Ser Leu Arg 
1 5 10 15 

Leu Ser Cys Ala Ala Ser Gly Phe He Phe Arg Asn Tyr Ala Met His 
20 25 30 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Leu He 
35 40 45 

o 

Lys Tyr Asp Gly Arg Asn Lys Tyr Tyr Ala Asp Ser Val Lys Gly Arg 
50 55 60 

Phe Thr He Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr Leu Gin Met 
65 70 75 80 

Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala Arg Asp 
85 90 95 

He Gly Leu Lys Gly Glu His Tyr Asp He Leu Thr Ala Tyr Gly Pro 
100 105 110 

Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Leu Glu Glu Ser Gly Glu Ala Val Val Gin Pro Gly Thr Ser Leu Arg 
15 10 15 

Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Arg Asn Tyr Ala Met His 
20 25 30 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Leu lie 
35 40 45 

Lys Tyr Asp Gly Arg Asn Lys Tyr Tyr Ala Asp Ser Val Lys Gly Arg 
50 55 60 

Phe Ser lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr Leu Glu Met 
65 70 75 80 

Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala Arg Asp 
85 90 95 

lie Gly Leu Lys Gly Glu His Tyr Asp lie Leu Thr Ala Tyr Gly Pro 
100 105 110 

Asp Tyr Trp Gly Gin Gly Ala Leu Val Thr Val Ser Ser 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Leu Glu Gin Ser Gly Glu Ala Val Val Gin Pro Gly Arg Ser Leu Arg 
15 10 15 

Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Arg Asn Tyr Ala Met His 
20 25 30 
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Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Leu He 
35 40 45 

Lys Tyr Asp Gly Arg Asn Lys Tyr Tyr Ala Asp Ser Val Lys Gly Arg 
50 55 60 

Fhe Thr He Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr Leu Gin Met 
65 70 75 80 

Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala Arg Asp 
85 90 95 

He Gly Leu Lys Ala Glu His Tyr Asp He Leu Thr Ala Tyr Gly Pro 
100 105 110 

Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
115 .120 125 

(2) INFORMATION FOR SEQ ID NO:77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Leu Glu Gin Ser Gly Glu Ala Vai Val Gin Pro Gly Arg Ser Leu Arg 
15 10 15 

Leu Ser Cys Ala Ala Ser Gly Fhe He Fhe Arg Asn Tyr Ala Met His 
20 25 30 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Leu He 
35 40 45 

Lys Tyr Asp Gly Arg Asn Lys Tyr Tyr Ala Asp Ser Val Lys Gly Arg 
50 55 60 

Fhe Thr He Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr Leu Gin Met 
65 70 75 80 

Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala Arg Asp 
85 90 95 

He Gly Leu Lys Gly Glu His Tyr Asp He Leu Thr Ala Tyr Gly Fro 
100 105 110 
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Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:78: 

Leu Glu Gin Ser Gly Gly Gly Val Val Lys Pro Gly Gly Ser Leu Arg 
1 5 10 15 

Leu Ser Cys Glu Gly Ser Gly Phe Thr Phe Pro Asn Ala Trp Met Thr 
20 25 30 

Trp Val Arg Gin Ser Pro Gly Lys Gly Leu Glu Trp Val Ala Ser lie 
35 40 45 

Lys Ser Lys Phe Asp Gly Gly Ser" Pro His Tyr Ala Ala Pro Val Glu 
50 55 60 

Gly Arg Phe Thr He Ser Arg Asn Asp Leu Glu Asp Lys Val Phe Leu 
65 70 75 80 

Gin Met Asn Gly Leu Lys Ala Glu Asp Thr Gly Val Tyr Tyr Cys Ala 
85 90 95 

Thr Arg Tyr Pro Arg Tyr Ser Glu Met Met Gly Gly Val Arg Lys His 
100 105 110 

Phe Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val Ser Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: . ! : \ .. 



Leu Glu Glu 
1 


Ser 


Gly 
5 


Gly 


Gly Val 

■ : ,., ! .* 


Val Lys Pro Gly Gly Ser Leu Arg 

10 15 ;-Vi>' , v.-./.--...' 


Leu Ser Cys 


Glu 
20 


Gly 


Ser 


Gly Phe 


Thr Phe Pro Asn Ala Trp Met Thr (V \ / 

25.;;; ',30 . ^ /v^v^'A^^- 


Trp Val Arg 
35 


Gin 


Ser 


Pro 


Gly Lys 
40 


Gly Leu Glu Trp Val Ala Ser lie 
45 


Lys Ser Lys 
50 


Phe 


Asp 


Gly 


Gly Ser 
55 


Pro His Tyr Ala Ala Pro Val Glu 

60 ■ ■ 


Gly Arg Phe 
65 


Thr 


lie 


Ser 
70 


Arg Asn 


Asp Leu Glu Asp Lys Val Phe Leu 
75 80 


Gin Met Asn 


Gly 


Leu 
85 


Lys 


Ala Glu 


Asp Thr Gly Val Tyr Tyr Cys Ala 

90 ■ H .. , ; , ; 95 ;•. :,.;r- , 
■ ; ." ; iyVi' : ; ; 4i- '■.••••' ,v t '"'-V^^^ 


Thr Arg Tyr 


Pro 
100 


Arg 


Tyr 


Ser Glu 


Met Met Gly Gly Val Arg Lys His' ^ 1 - ( 

105 , ■ . 110 :,. v \ ^v, -^v <; 



Phe Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val Ser Val Ser Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO:80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Leu Glu Glu Ser Gly Gly Gly Leu Val Gin Fro Gly Arg Ser Leu Arg 
1 5 10 15 

Val Ser Cys Glu Ala Ser Gly Phe Thr Phe Ser Ser Tyr Glu Met Asn 
20 25 30 . \ 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ser Gin lie 
35 40 ' 45 



Ser Ser Ser Gly Ser Arg Thr Tyr Tyr Ala Asp Ser Val Lys Gly Arg 
50 * 55 60 
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Phe Thr lie Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr Leu Glu Met 
65 70 75 80 

Thr Ser Leu Arg Val Asp Asp Thr Ala Val Tyr Tyr Cys Ala Arg Gly 
85 90 95 

Arg Arg Leu Val Thr Phe Gly Gly Val Val Ser Gly Gly Asn He Trp 
100 105 110 

Gly Gin Gly Thr Met Val Thr Val Ser Ser 
115 120 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino adds 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Leu Glu Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg Ser Leu Arg 
15 10 15 

Leu Ser Cys Ala Gly Ser Gly Phe Asn Phe Ser Asp Asp Thr Met His 
20 25 30 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Val He 
35 40 45 

Ser Tyr Glu Gly Ser Asp Lys Tyr Tyr Ala Asp Ser Val Lys Gly Arg 
50 55 60 

Phe Thr He Ser Arg Asp Asn Ser Glu Asn Thr Leu Tyr Leu Gin Met 
65 70 75 80 

Asp Ser Leu Arg Ala Asp Asp Thr Ala Leu Tyr Tyr Cys Ala Arg Asn 
85 90 95 

Thr Arg Glu Asn He Glu Ala Asp Gly Thr Ala Tyr Tyr Ser Tyr Tyr 
100 105 110 

Met Asp Val Trp Gly Lys Gly Thr Thr Val Thr Val Ser Ser 
115 120 125 

(2) INFORMATION FOR SEQ ID NO:82: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg 
1 5 10 15 

Val Thr lie Thr Cys Arg Ala Ser Gin Gly lie Ser Asn Tyr Leu Ala 
20 25 30 

Trp Tyr Gin Gin Lys Pro Gly Lys Val Pro Arg Leu Leu He Tyr Ala 
35 40 45 

Ala Ser Thr Leu Gin Pro Gly Val Pro Ser Arg Phe Ser Gly Ser Gly 
50 55 60 

Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp 
65 70 75 80 

Val Ala thr Tyr Tyr Cys Gin Lys Tyr Asn Ser Ala Pro Arg Thr Phe 
85 90 95 

Gly Gin Gly Thr Lys Val Glu He Lys Arg Thr 
100 105 

(2) INFORMATION FOR SEQ ID N0:83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser He Gly Asp Arg 
15 10 15 

Val Thr He Thr Cys Arg Ala Ser Gin Gly He Asn Asn Tyr Leu Ala 
20 25 30 

Trp Tyr Gin Gin Arg Pro Gly Lys Val Pro Arg Leu Leu He Tyr Ala 
35 40 45 
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Ala Ser Thr Leu Gin Ser Gly Val Pro Thr Arg Phe Ser Gly Ser Gly 
50 55 60 

Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu Asp 
65 70 75 80 

Val Ala Thr Tyr Tyr Cys Gin Lys Tyr Asn Ser Val Pro Arg Thr Phe 
85 90 95 

Gly Gly Gly Thr Lys Val Glu lie Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 84: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid. 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg 
1 5 10 15 

Val Thr lie Thr Cys Arg Ala Ser Gin Gly lie Ser Asn Tyr Leu Ala 
20 25 30 

Trp Tyr Gin Gin Lys Pro Gly Lys Val Pro Lys Leu Leu lie Tyr Ala 
35 40 45 

Ala Ser Thr Leu Gin Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly 
50 55 60 

Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu Asp 
65 70 75 80 

Val Ala Thr Tyr Tyr Cys Gin Lys Tyr Asn Ser Ala Pro Arg Thr Phe 
85 90 95 

Gly Gin Gly Thr Lys Val Glu lie Lys Arg Thr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



WOW/07922 



PCT/US93/09328 



177 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser lie Gly Asp Arg 
1 5 10 15 

Val Thr He Thr Cys Arg Ala Ser Gin Gly He Asn Asn Tyr Leu Ala 
20 25 30 

Trp Tyr Gin Gin Arg Pro Gly Lys Ala Pro Asn Leu Leu He Tyr Ala 
35 40 45 

Ala Ser Thr Leu Gin Ser Gly Val Pro Pro Arg Phe Ser Gly Ser Gly 
50 55 60 

Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp 
65 70 75 80 

Val Ala Thr Tyr Tyr Cys Gin Lys Tyr Asn Ser Val Pro His Thr Phe 
85 90 95 

Gly Gly Gly Thr Lys Val Glu He Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Glu Arg 
1 5 10 15 

Ala Thr Leu Ser Cys Arg Ala Ser Gin Ser Val He Ser Asn Tyr Leu 
20 25 30 

Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu He Tyr 
35 40 45 

Gly Val Ser Asn Arg Ala Thr Gly He Pro Asp Arg Phe Ser Gly Ser 
50 55 60 
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Gly Ser Gly Thr Asp Fhe Thr Leu Thr lie Ser Arg Leu Glu Pro Glu , 

65 70 • • v..- 75'v: ■• •• :^y> ■ ■■ bo 

'• \ * 

Asp Phe Ala Val Tyr Ser Cys Gin Gin Tyr Gly Thr Ser Pro Trp Thr 

85 - -90 ' ..!•■ -95 • \X 

Phe Gly Gin Gly Thr Lys Val Glu He Lys Arg Thr , 

ioo ' ■ \ '■: io5-- t - },,; "\. ' " 1 ^ 

(2) INFORMATION FOR SEQ ID NO:87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 
(£) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: V 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Glu Arg 
1 5 10 15 

Ala Thr Leu Ser Cys Arg Ala Ser Gin Ser Val Ser Asn Asn Tyr Leu 
20 25 30 

Ala Trp Tyr Gin Gin Arg Pro Gly Gin Ala Pro Arg Leu Leu lie Tyr 
35 40 45 

Gly Ala Ser Asn Arg Ala Thr Gly lie Pro Asp Arg Fhe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Ala Fhe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu ■ 
65 70 75 t 80 : 

Asp Val Ala lie Tyr Tyr Cys Gin Gin Tyr His Ser Ser Pro Tyr Thr V 

V 85 : ■ 90 ' ' '/:>-. ■. 95 

Phe Gly Gin Gly Thr Lys Leu Glu He Lys Arg 

100 105 ; , ; 

(2) INFORMATION FOR SEQ ID NO:88: s '/ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid .. V . \ V'...'.' '' '':!' :l ■ /*'. ; ' 
(D) TOPOLOGY: linear '\l : \' f 

(ii) MOLECULE TYPE: protein * : : 



WO 94/07522 



PCT/US93/09328 

179 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Glu Arg 
15 10 15 

Ala Thr Leu Ser Cys Arg Ala Ser His Arg Val Asn Asn Asn Phe Leu 
20 25 30 

Ala Trp Tyr Gin Gin Lys Pro Gin Ala Pro Arg Leu Leu lie Ser Gly 
35 40 45 

Ala Ser Thr Arg Ala Thr Gly He Pro Asp Arg Phe Ser Gly Ser Gly 
50 55 60 

Ser Gly Thr Asp Phe Thr Leu Thr He Ser Arg Leu Glu Pro Asp Asp 
65 70 75 80 

Phe Ala Val Tyr Tyr Cys Gin Gin Tyr Gly Asp Ser Pro Leu Tyr Ser 
■ 85 90 95 

Phe Gly Gin Gly Thr Lys Leu Glu He Lys Arg Thr 
100 105 

t (2) INFORMATION FOR SEQ ID »0:89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Glu Leu Thr Gin Ser Pro Ala Ser Val Ser Ala Ser Val Gly Asp Thr 
15 10 15 

Val Thr lie Thr Cys Arg Ala Ser Gin Asp He His Asn Trp Leu Ala 
20 25 30 

Trp Tyr Gin Gin Gin Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Ala 
35 40 45 

Ala Ser Ser Leu Gin Ser Gly Val Pro Ser Arg Phe Ser Gly Arg Gly 
50 55 60 

Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp 
65 70 75 80 
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Fhe Ala Thr Tyr Tyr Cys Gin Gin Gly Asn Ser Phe Pro Lys Phe Gly 
85 90 ' 95 

Pro Gly Thr Val Val Asp He Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 90: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Glu Arg 
1 5 10 15 

Ala Thr Leu Ser Cys Arg Ala Ser Gin Ser Leu Ser Asn Asn Tyr Leu 
20 25 30 

Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu He Tyr 
35 40 45 

Gly Ser Ser Thr Arg Gly Thr Gly He Pro Asp Arg Phe Ser Gly Gly 
50 55 , 60 

Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Arg Leu Glu Pro Glu 
65 70 75 80 

Asp Phe Ala Val Tyr Tyr Cys Gin His Tyr Gly Asn Ser Val Tyr Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Glu He Lys Arg ' 
100 105 

(2) INFORMATION FOR SEQ ID N0:91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
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Gin Ser Pro Asp Thr Leu Ser Leu Asn Pro Gly Glu Arg Ala Thr Leu 
1 5 10 15 

Ser Cys Arg Ala Ser His Arg lie Ser Ser Lys Arg Leu Ala Trp Tyr 
20 25 30 

Gin His Lys Arg Gly Gin Ala Pro Arg Leu Leu lie Tyr Val Cys Pro 
35 40 45 

Asn Arg Ala Gly Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly 
50 55 . 60 

Thr Asp Phe Thr Leu Thr Tyr Ser Arg Leu Glu Pro Glu Asp Phe Ala 
65 70 75 80 

Met Tyr Tyr Cys. Gin Tyr Tyr Gly Gly Ser Ser Tyr Thr Phe Gly Gin 
85 90 95 

Gly Thr Lys Val Glu lie Thr Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Gin Ser Pro Ser His Leu Ser Leu Ser Pro Gly Glu Arg Ala He Leu 
15 10 15 

Ser Cys Arg Ala Ser Gin Arg Val Ser Ala Pro Tyr Leu Ala Trp Tyr 
20 25 30 

Gin Gin Arg Pro Gly Gin Ala Pro Arg Leu Val He Tyr Gly Ala Ser 
35 40 45 

Thr Arg Ala Thr Asp He Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly 
50 55 60 

Thr Asp Phe Thr Leu Thr He Ser Arg Leu Glu Pro Glu Asp Phe Ala 
65 70 75 80. - 

He Tyr Tyr Cys Gin Val Tyr Gly Gin Ser Pro Val Leu Phe Gly Gin 
85 90 95 
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Gly Thr Lys Leu Glu Met Lys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Asp Arg Ala Thr Leu 
15 10 15 

Ser Cys Arg Ala Ser Gin Ser Leu Ser Ser Ser Phe Leu Ala Trp Tyr 
20 25 30 

Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu lie Tyr Ser Ala Ser 
35 40 45 

Met Arg Ala Thr Gly lie Pro Asp Arg Phe Arg Gly Ser Val Ser Gly 
50 55 60 

Thr Asp Phe Thr Leu Thr lie Thr Arg Leu Glu Pro Glu Asp Phe Ala 
65 70 75 80 

Val Tyr Tyr Cys Gin Arg Phe Gly Thr Ser Pro Leu Tyr Thr Phe Gly 
85 90 95 

Gin Gly Thr Lys Leu Glu Met Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID N0:94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Glu Arg Ala Thr Leu 
15 10 15 
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Ser Cys Arg Ala Ser Gin Ser Phe Ser Ser Asn Phe Leu Ala Trp Tyr /o 

20 25 30 '-V: 

Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu lie Tyr Val His Pro 

35 - 40'- ' 45 ' V V: " 'y.V 

Asn Arg Ala Thr Gly Val Pro Asp Arg Phe Ser Gly Ser, Gly Ser Gly y';& 

50 55 60 : I 

Thr Asp Phe Thr Leu Thr lie Arg Arg Leu Glu Pro Glu Asp Phe Ala 
65 70 75 80 

Val Tyr Tyr Cys Gin Gin Tyr Gly Ala Ser Leu Val Ser Phe Gly Pro 
85 90 95 

Gly Thr Lys Val His He Lys Arg 
100 

* * • ■ ■ . - j 

(2) INFORMATION FOR SEQ ID NO: 95: ; /^i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Glu Leu Thr Gin 
1 

Ala Thr Phe Ser 
20 

Ala Trp Tyr Gin 
35 

Gly Val Ser Asn 
50 

Gly Ser Gly Thr 
65 

Asp Phe Ala Leu 



Phe Gly Gin Gly 
100 



Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Glu Arg 
5 10 15 

Cys Arg Ser Ser His Ser He Arg Ser Arg Arg Val 
25 30 

His Lys Pro Gly Gin Ala Pro Arg Leu Val lie His 

40 V- - - . 45' l v "./- ;■• *;^;t 

Arg Ala Ser Gly He Ser Asp Arg Phe Ser Gly Ser 
• 55 60 

Asp Phe Thr Leu Thr He Thr Arg Val Glu Pro Glu 

70 75 : :; 80 

Tyr Tyr Cys Gin Val Tyr Gly Ala Ser Ser Tyr Thr 

85 -90 '• V l {; v - ' 95 

Thr Lys Leu Glu Arg Lys Arg Thr ! 

■' 105 • • ; ■ ' ' 
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(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Thr Pro Gly Glu Arg 
15 10 15 

Ala Thr Leu Ser Cys Arg Thr Ser His Ser lie Arg Ser Arg Arg Leu 
20 25 30 

Ala Trp Tyr Gin Val Lys Gly Gly Gin Ala Pro Arg Leu Leu lie Tyr 
35 40 45 

Gly Val Ser Asn Arg Ala Gly Gly lie Pro Asp Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Arg Leu Glu Pro Glu 
65 70 75 8 ° 

Asp Phe Ala Val Tyr Tyr Cys Gin Gin Tyr Gly Ser Ser Arg Tyr Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Glu lie Lys Arg Thr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Glu Leu Thr Gin Ala Pro Gly Thr Leu Ser Leu Ser Pro Gly Glu Arg 
15 10 15 

Ala Thr Phe Ser Cys Arg Ser Ser His Ser lie Arg Ser Arg Arg Val 
20 25 30 
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Arg Trp Tyr Gin His Lys Pro Gly Gin Ala Pro Arg Leu Val II* His 
35 40 45 

Gly Val Ser Asn Arg Ala Ser Gly lie Ser Asp Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Phe Xhr Leu Thr lie Thr Arg Val Glu Pro Glu 
65 70 75 80 

Asp Phe Ala Leu Tyr Tyr Cys Gin Val Tyr Gly Ala Ser Ser Tyr Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Glu Arg Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Glu Leu Thr Gin Ala Pro Gly Thr Leu Ser Leu Ser Pro Gly Asp Arg 
1 5 1° 15 

Ala Thr Phe Ser Cys Arg Ser Ser His Asn lie Arg Ser Arg Arg Val 
20 25 30 

Ala Trp Tyr Gin His Lys Pro Gly Gin Ala Pro Arg Leu Val lie His 
35 40 45 

Gly Val Ser Asn Arg Ala Ser Gly lie Ser Asp Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Thr Arg Leu Glu Pro Glu 
65 70 75 80 

Asp Phe Ala Leu Tyr Tyr Cys Gin Val Tyr Gly Ala Ser Ser Tyr Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Asp Phe Lys Arg Thr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 



v 
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(A) LENGTH: 108 amino adds 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Glu Arg 
15 10 15 

Ala Thr Leu Ser Cys Arg Ala Gly Gin Ser lie Ser Ser Asn Tyr Leu 
20 25 30 

Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu lie Tyr 
35 40 45 

Gly Ala Ser Asn Arg Ala Thr Gly lie Pro Asp Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Fhe Thr Leu Ser lie Ser Arg Leu Glu Pro Glu 
65 70 75 80 

Asp Phe Ala Val Tyr Tyr Cys Gin Gin Tyr Gly Thr Ser Pro Tyr Thr 
85 90 95 

Phe Gly Gin Gly Thr Gin Leu Asp lie Lys Arg Thr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Glu Arg Ala Thr Leu 
1 5 10 15 

Ser Cys Arg Ala Ser Gin Ser Leu Ser Asn Asn Tyr Leu Ala Trp Tyr 
20 25 30 

Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu lie Tyr Gly Ser Ser 
35 40 45 
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Thr Arg Ala Thr Gly lie Pro Asp Arg Phe Ser Gly Gly Gly Ser Gly 
50 55 60 

Thr Asp Phe Thr Leu Thr lie Ser Arg Leu Glu Pro Glu Asp Phe Ala 
65 70 75 80 

Val Tyr Tyr Cys Gin Gin Tyr Gly Asn Ser Val Tyr Thr Phe Gly Gin 
85 90 95 

Gly Thr Lys Leu Glu lie Lys Arg 
100 

(2) INFORMATION FOR SEQ ID NO: 101: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg 
1 5 .10 15 

Val Thr lie Thr Cys Arg Thr Ser Gin Gly lie Ser Asn Tyr Leu Ala 
20 25 30 

Trp Tyr Gin Gin Lys Pro Gly Lys Val Pro Lys Leu Leu lie Tyr Gly 
35 40 45 

Ala Ser Thr Leu Gin Ser Gly Gly Pro Ser Arg Phe Ser Gly Ser Gly 
50 55 60 

Ser Gly Thr Asp Phe Thr Leu Thr lie Ash Ser Leu Gin Pro Glu Asp 
65 70 75 80 

Val Ala Thr Tyr Ser Cys Gin Asn Tyr Asp Ser Ala Pro Trp Thr Phe 
85 90 95 

Gly Gin Gly Thr Lys Val Asp lie Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg 
1 5 10 15 

Val Thr lie Thr Cys Arg Ala Ser Gin Ser lie Ser Asn Tyr Leu Asn 
20 25 30 

Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Ala 
35 40 45 

Ala Ser Ser Leu Gin Arg Gly Val Pro Ser Arg Phe Ser Gly Ser Gly 
50 55 60 

Ser Gly Thr Asp Phe Thr Leu Ser lie Ser Ser Leu Gin Pro Glu Asp V: 

65 70 ' ■ 75 ■ ^ I " V' . ''- 80 ; /'C^j 

Phe Ala Thr Tyr Tyr Cys Gin Gin Ser Tyr Ser He Pro Pro Leu Thr ; f/ 

85 90 95 - v : 

Phe Gly Gly Gly Thr Lys Val Glu lie Lys Arg Thr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 103: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein /V 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg 
1 5 10 15 

Val Thr He Thr Cys Arg Ala Ser Gin Asn lie Asn Asn Tyr Leu Asn 

20 25 - '< 30 

Trp Tyr Gin Gin Lys Pro Gly Glu Ala Pro Lys Leu Leu lie His Thr 

35 40 45 ; : 

Ala Phe Asn Leu Gin Ser Gly Val Pro Ser Arg Phe Ser Gly Thr Ala 

50 55 > ^ 60 



' i i 1 • 



. i . 
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Ser Gly Thr Glu Phe Thr Leu Thr lie Arg Ser Leu Gin Pro Glu Asp 
65 70 75 80 

Phe Ala Thr Tyr Tyr Cys Gin Gin Ser Tyr Ser Thr Pro Tyr Thr Phe 
85 90 95 

Gly Gin Gly Thr Lys Val Glu lie Lys Arg Thr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Glu Leu Thr Gin Ser Pro Ser Ser 
1 5 

Val Thr lie Thr Cys Arg Ala Ser 
20 

Trp Tyr Gin Gin Lys Pro Gly Lys 
,35 40 

Ala Ser Ser Leu Gin Ser Gly Val 
50 55 

Ser Gly Thr Asp Phe Thr Leu Thr 
65 70 

Phe Ala Thr Tyr Tyr Cys Gin Gin 
85 

Gly Gin Gly Thr Lys Leu Glu lie 
100 

(2) INFORMATION FOR SEQ ID N0:105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



Leu Ser Ala Ser Val Gly Asp Arg 
10 15 

Gin Ser lie Ser Ser Tyr Leu Asn 
25 30 

Ala Pro Lys Leu Leu lie Tyr Ala 
45 



Pro Ser Arg Phe Ser Gly Ser Gly 
60 

lie Ser Ser Leu Gin Pro Glu Asp 
75 80 

Ser Tyr Ser Thr Pro Tyr Thr Phe 
90 95 

Lys Arg Thr 
105 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg 
15 10 15 

Val Thr lie Thr Cys Arg Ala Ser Gin Ser lie Ser Ser Tyr Leu Asn 
20 25 30 

Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr Ala 
35 40 45 

Ala Ser Ser Leu Gin Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly 
50 55 60 

Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu Asp 
65 70 75 80 

Phe Ala Thr Tyr Tyr Cys Gin Gin Ser Tyr Ser Thr Pro Gin Thr Phe 
85 90 95 

Gly Gin Gly Thr Lys Leu Glu lie Lys Arg Thr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 106: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie 
1 5 10 15 

Thr Cys Arg Ala Ser Gin Thr lie Ser Ser Tyr Leu Asn Trp Tyr Gin 
20 25 30 

Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr Ala Ala Ser Ser 
35 40 45 

Leu Gin Ser Gly Val Pro Ser Arg Phe Ser Gly Gly Gly Ser Gly Thr 
50 55 60 

Asp Phe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu Asp Fhe Ala Thr 
65 70 75 80 
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Tyr Tyr Cys Gin Gin Ser Tyr Ser Thr Pro Tyr Thr Phe Gly Gin Gly 
85 90 95 

Thr Lys Leu Glu lie Lys Arg Thr 
100 

(2) INFORMATION FOR SEQ ID N0:107: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg 
1 5 10 15 

Val Thr lie Thr Cys Gin Ala Ser Gin Asp lie Arg Asn Tyr Leu Asn 
20 25 30 

Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr Asp 
35 40 45 

Ala Ser Asn Ser Glu Thr Gly Val Pro Ser Arg Phe Ser Gly Ser Gly 
50 55 60 

■ \ V . 

Ser Gly Arg Asp Phe Thr Phe Thr lie Ser Ser Leu Gin Pro Glu Asp 
65 70 75 80 

Val Ala Thr Tyr Tyr Cys Gin Gin His Gin Asn Val Pro Leu Thr Phe 
85 90 95 

Gly Gly Gly Thr Lys Val Glu lie Lys Arg Thr , 
100 105 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
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Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg 
1 5 10 15 

Val Thr lie Thr Cys Gin Ala Ser Gin Asp lie Ser Asn His Leu Asn 
20 25 30 

Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr Asp 
35 40 45 

Ala Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phe Ser Gly Ser Gly 
50 55 60 

Ser Gly Thr Asp Phe Thr Phe Thr lie Ser Ser Leu Gin Pro Glu Asp 
65 70 75 80 

lie Ala Thr Tyr Tyr Cys Gin Gin Tyr Asp Asn Leu Pro Leu Thr Phe 
85 90 95 

Gly Gly Gly Thr Lys Val Glu lie Lys Arg Thr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg 
15 10 15 

lie Thr lie Thr Cys Arg Ala Ser Gin Thr lie Asn Asn Tyr Leu Asn 
20 25 30 

Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr Gly 
35 40 45 

Ala Ser Asn Leu Gin Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly 
50 55 60 

Ser Gly Thr Asp Fhe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu Asp 
65 70 75 80 

Phe Ala Thr Tyr Phe Cys Gin Gin Ser Tyr Asn Thr Pro Pro Trp Thr 
85 90 95 
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Fhe Gly Gin Gly Thr Lys Val Glu lie Lys Arg Thr 

100 105 V / 

(2) INFORMATION FOR SEQ ID NO:110: , V , V V ,/ 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids ;' :\v-y'.- ■ V .\V, 

(B) TYPE: amino acid ' v 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:110: 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Glu Arg 
1 5 10 15 

Ala Thr Leu Ser Cys Arg Ala Ser Gin Arg Val Asn Ser Asn Tyr Leu & 
20 25 30 

Ala Trp Tyr Gin Gin Lys Pro Gly Gin Thr Pro Arg Val Val lie Tyr V 

35 40 ,45 : 

Ser Thr Ser Arg Arg Ala Thr Gly Val Pro Asp Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Arg Leu Glu Pro Glu 
65 70 75 80 

Asp Phe Ala Val Tyr Tyr Cys Gin Gin Phe Gly Asp Ala Gin Tyr Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Glu lie Lys Arg Thr 
100 105 

(2) INFORMATION FOR SEQ ID NO:lll: / WU? ' A 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE; amino acid 

(D) TOPOLOGY: linear V. 

(ii) MOLECULE TYPE: protein '.'■"%>[ 



I ■<' ■ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 



Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gin Arg Val Asn Ser Asn 
1 5 10 15 
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Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Thr Pro Arg Val Val 
20 25 30 

lie Tyr Ser Thr Ser Arg Arg Ala Thr Gly Val Pro Asp Arg Phe Ser 
35 40 45 

Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Arg Leu Glu 
50 55 60 

Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Phe Gly Asp Ala Gin 
65 70 75 80 

Tyr Thr Phe Gly Gin Gly Thr Lys Leu Glu lie Lys Arg 
85 90 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Thr Gin Ser Pro Ser Ser Val Ser Ala Ser Val Gly Asp Thr Val Thr 
1 5 10 15 

Phe Thr Cys Arg Ala Ser Gin Asp lie Arg Asn Tyr Leu Asn Trp Tyr 
20 25 30 

His Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Ser Asp Ala Ser 
35 40 45 

Asp Leu Glu He Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Ala 
50 55 60 

Thr Tyr Phe Ser Phe Thr He Ser Ser Leu Gin Pro Glu Asp He Gly 
65 70 75 80 

— Thr Tyr Tyr Cys Gin Gin Tyr Ala Asp Leu He Thr Phe Gly Gly Gly 
85 90 95 

Thr Lys Val Glu He Lys Arg Thr 
100 

(2) INFORMATION FOR SEQ ID NO: 113: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Ser Pro Gly Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gin Ser Val 
15 10 15 

Gly Thr Asn Leu Ala Tip Tyr Gin Gin Lys Fro Gly Gin Ala Pro Arg 
20 25 30 

Leu Leu lie Phe Asp Ala Ser Thr Arg Asp Thr Tyr lie Pro Asp Thr 
35 40 45 

Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Ala Leu Thr lie Ser Ser 
50 55 60 

Leu Gin Ser Glu Asp Phe Gly Phe Tyr Tyr Cys Gin Gin Tyr Asp Asn 
65 70 75 80 

Trp Pro Pro Thr Phe Gly Gin Gly Thr Lys Leu Glu Val Lys Arg Thr 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Asp Arg 
15 10 15 

Ala Thr Phe Ser Cys Arg Ser Ser His Asn lie Arg Ser Arg Arg Val 
20 25 30 

Ala Trp Tyr Gin His Lys Pro Gly Gin Ala Pro Arg Leu Val He His 
35 40 45 

Gly Val Ser Asn Arg Ala Ser Gly He Ser Asp Arg Phe Ser Gly Ser 
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50 55 60 

Gly Ser Gly Thr Asp Phe Thr Leu Thr He Thr Arg Leu Glu Pro Glu 
65 70 75 80 

Asp Phe Ala Leu Tyr Tyr Cys Gin Val Tyr Gly Ala Ser Ser Tyr Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Asp Phe Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly Glu Arg 
1 5 10 15 

Ala Thr Phe Ser Cys Arg Ser Ser His Asn He Arg Ser Arg Arg Val 
20 25 30 

Ala Trp Tyr Gin His Lys Pro Gly Gin Ala Pro Arg L*u Val He His 
35 40 45 

Gly Val Ser Asn Arg Ala Thr Gly He Ser Asp Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Phe Thr Leu Thr He Thr Arg Leu Glu Pro Glu 
65 70 75 80 

Asp Phe Ala Leu Tyr Tyr Cys Gin Val Tyr Gly Ala Ser Ser Tyr Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Asp Phe Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO:116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



PCT/US93/09328 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Glu Leu Thr Gin Ser Pro Asp Thr Leu Ser Leu Asn Val Gly Glu Arg 
1 5 10 15 

Ala Thr Leu Ser Cys Arg Ala Ser His Arg lie Ser Ser Arg Arg Leu 
20 25 30 

Ala Trp Tyr Gin His Lys Arg Gly Gin Ala Pro Arg Leu Leu lie Tyr 
35 40 45 

Gly Val Ser Ser Arg Ala Gly Gly Val Pro Asp Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Phe Ser Leu Thr lie Ser Arg Leu Glu Pro Glu 
65 ,70 75 80 

Asp Phe Ala Met Tyr Tyr Cys Gin Thr Tyr Gly Gly Ser Ser Tyr Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Val Asp lie Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID N0:117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Glu Leu Thr Gin Ser Pro Asp Thr Leu Ser Leu Asn Ala Gly Glu Arg 
1 5 10 15 

Ala Thr Leu Ser Cys Arg Ala Ser His Arg lie Ser Ser Arg Arg Leu 
20 25 30 

Ala Trp Tyr Gin His Lys Arg Gly Gin Ala Pro Arg Leu Leu lie Tyr 
35 40 45 

Gly Val Ser Asn Arg Ala Gly Gly Val Pro Asp Arg Phe Ser Gly Ser 
50 55 60 
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Gly Ser Gly Thr Asp Phe Ser Leu Thr lie Ser Arg Leu Glu Pro Glu 

65 70 ";V 75 i "V 80 

Asp Phe Ala He Tyr Tyr Cys Gin Thr Tyr Gly Gly Ser Ser Tyr Thr \ 

85 :];;■.? ;90 jv .' % 95 

Phe Gly Gin Gly Thr Thr Val Asp He Lys Arg , 

100 105;; ;V 

(2) INFORMATION FOR SEQ ID N0:118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



Glu 
1 


Leu 


Thr Gin 


Ser 
5 


Pro 


Asp 


Thr Leu Ser Leu Asn Thr Gly Glu Arg 
10 15 


Ala 


Thr 


Leu Ser 
20 


Cys 


Arg 


Ala 


Ser His Arg lie Gly Ser Arg Arg Leu 
25 30 


Ala 


Trp 


Tyr Gin 
35 


His 


Arg 


Arg 


Gly Gin Ala Pro Arg Leu Leu lie Tyr 
40 45 


Gly 


Val 
50 


Ser Asn 


Arg 


Ala 


Gly 
55 


Gly Val Pro Asp Arg Phe Ser Gly Ser 
60 


Gly 
65 


Ser 


Gly Thr 


Asp 


Phe 
70 


Thr 


Leu Thr He Ser Arg Leu Glu Pro Glu 
75 80 


Asp 


Phe 


Ala He 


Tyr 
85 


Tyr 


cys 


Gin Thr Tyr Gly Gly Ser Ser Tyr Thr 
90 95 


Phe 


Gly 


Gin Gly 
100 


Thr 


Lys 


Val 


Asp He Lys Arg 

• 105 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear V 

(ii) MOLECULE TYPE: protein 7 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Thr Pro Gly Glu Arg 
1 5 10 15 

Ala lie Leu Ser Cys Lys Thr Ser His Asn lie Trp Ser Arg Arg Leu 
20 25 30 

Ala Trp Tyr Gin Leu Lys Ser Gly Gin Ala Pro Arg Leu Leu lie Tyr 
35 AO 45 

Gly Val Ser Lys Arg Ala Gly Gly lie Pro Asp Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Ala Thr Asp Phe Thr Leu Thr lie Ser Arg Val Glu Pro Glu 
65 70 75 80 

Asp Phe Ala Val Tyr Tyr Cys Gin Thr Tyr Gly Gly Ser Ala Tyr Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Asp lie Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO; 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Thr Pro Gly Glu Arg 
1 5 10 15 

Ala lie Leu Ser Cys Lys Thr Ser His Asn lie Trp Ser Arg Arg Leu 
20 25 30 

Ala Trp Tyr Gin Leu Lys Ser Gly Gin Ala Pro Arg Leu Leu lie Tyr 

< ; 35 40 45 

Gly Val Ser Lys Arg Ala Gly Gly lie Pro Asp Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Ala Thr Asp Phe Thr Leu Thr lie Ser Arg Val Glu Pro Glu 
65 70 75 80 
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Asp Phe Ala Val Tyr Tyr Cys Gin Thr Tyr Gly Gly Ser Ala Tyr Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Glu lie Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Ser Thr Pro Gly Glu Arg 
15 10 15 

Ala lie Leu Ser Cys Lys Thr Ser His Asn lie Trp Ser Arg Arg Leu 
20 25 30 

Ala Trp Tyr Gin Val Lys Ser Gly Leu Pro Pro Arg Leu Leu lie His 
35 40 45 

Gly Val Ser Arg Arg Ala Gly Gly lie Pro Asp Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Ala Arg Asp Hie Thr Leu Thr lie Ser Arg Leu Glu Pro Ala 
65 70 75 80 

Asp Phe Ala Val Tyr Tyr Cys Gin Thr Tyr Gly Gly Ser Ser Tyr Ser 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Asp Phe Asn Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
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Glu Leu Thr Gin Ser Fro Gly Thr Leu Ser Leu Asn Pro Gly Glu Arg 
1 5 10 15 

Ala Val Leu Ser Cys Arg Thr Ser Arg Asn lie Trp Ser Arg Arg Leu 
20 25 30 

Ala Trp Tyr Gin Val Arg Arg Gly Gin Ala Pro Arg Leu Leu lie His 
35 40 45 

Gly Val Ser Lys Arg Ala Gly Gly Val Pro Asp Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Ala Arg Asp Phe Thr Leu Thr lie Ser Arg Leu Glu Pro Glu 
65 70 75 80 

Asp Phe Ala Val Tyr Phe Cys Gin Thr Tyr Gly Gly Ser Ser Tyr Thr 
85 90 95 

Phe Gly Gin Gly Asn Lys Leu Asp lie Arg Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Gin Val Lys Leu 
1 

Ala Ser Val Lys 
20 

Phe Val Leu His 
35 

Met Gly Trp He 
50 

Phe Gin Asp Arg 
65 

Tyr Leu Glu Leu 



Leu Glu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly 
5 10 15 

Val Ser Cys Gin Ala Ser Gly Tyr Arg Phe Ser Asn 
25 30 

Trp Ala Arg Gin Ala Pro Gly His Arg Pro Glu Trp 
40 45 

Asn Pro Ala Asn Gly Val Thr Glu He Pro Pro Lys 
55 60 

Val Ser Leu Thr Arg Asp Thr Ser Ala Gly Thr Val 
70 75 80 



Thr Asn Leu Arg Phe Ala Asp Thr Ala Val Tyr Tyr 
85 90 95 
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Cys Ala Arg Val Gly Glu Trp Thr Trp Asp Asp Ser Pro Gin Asp Asn 
100 105 110 

Tyr Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val Thr Val 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

. Gin Val Lys Leu Leu Glu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly 
15 10 15 

Ala Ser Val Lys Val Ser Cys Gin Ala Ser Gly Tyr Arg Phe Ser Asn 
20 25 30 

Phe Val Leu His Trp Ala Arg Gin Ala Pro Gly His Arg Pro Glu Trp 
35 40 45 

Met Gly Trp lie Asn Pro Ala Asn Gly Val Thr Glu lie Ser Pro Lys 
50 55 60 

Phe Gin Asp Arg Val Ser Leu Thr Gly Asp Thr Ser Ala Ser Thr Val 
65 70 75 80 

Tyr Leu Glu Leu Arg Asn Leu Arg Phe Ala Asp Thr Ala Val Tyr Tyr 
85 90 95 

Cys Ala Arg Val Gly Glu Trp Thr Trp Asp Asp Ser Pro Gin Asp Asn 
100 105 110 

Tyr Tyr Met Asp Val Trp Gly Arg Gly Thr Thr Val Thr 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Gin Val Lys Leu Leu Glu Gin Ser Gly Ma Glu Val Lys Lys Pro Gly 
1 5 10 15 

Ala Ser Val Lys Val Ser Cys Gin Ala Ser Gly Tyr Arg Phe Ser Asn 
20 25 30 

Phe Val Leu His Trp Ala Arg Gin Ala Pro Gly His Arg Pro Glu Trp 
35 40 45 

Met Gly Trp lie Asn Pro Ala Asn Gly Val Thr Glu lie Ser Pro Lys 
50 55 60 

Phe Gin Asp Arg Val Ser Leu Thr Gly Asp Thr Ser Ala Ser Thr Val 
65 70 75 80 

Tyr Leu Glu Leu Arg Ser Leu Arg Phe Ala Asp Thr Ala Val Tyr Tyr 
85 90 95 

Cys Ala Arg Val Gly Glu Trp Thr Trp Asp Asp Ser Pro Gin Asp Asn 
100 105 110 

Tyr Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val 
115 120 

(2) INFORMATION FOR SEQ ID NO: 126: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

Gin Val Lys Leu Leu Glu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly 
1 5 10 15 

Ala Ser Val Lys lie Ser Cys Gin Ala Ser Gly Tyr Arg Phe Thr Asn 
20 25 30 

Phe Val Leu His Trp Ala Arg Gin Ala Pro Gly Gin Arg Pro Glu Trp 
35 40 45 

Met Gly Trp Phe Asn Pro Ala Asn Gly lie Lys Glu lie Ser Pro Lys 
50 55 60 
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Phe Gin Asp Arg Val Ser Phe Thr Gly Asp Thr Ser Ala Ser Thr Ala 
65 70 75 80 

Tyr Val Glu Leu Arg Asn Leu Arg Ser Ala Asp Thr Ala Val Tyr Tyr 
85 90 95 

Cys Ala Arg Val Gly Pro Trp Thr Trp Asp Asp Ser Pro Gin Asp Asn 
100 105 110 

Tyr Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val 
115 120 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 

Gin Val Lys Leu Leu Glu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly 
1 5 10 15 

Ala Ser Val Lys Val Ser Cys Gin Ala Ser Gly Tyr Arg Phe Ser Asn 
20 25 30 

Phe Val Leu His Trp Ala Arg Gin Ala Pro Gly His Arg Pro Glu Trp 
35 40 45 

Met Gly Trp lie Asn Pro Ala Asn Gly Val Thr Glu lie Ser Pro Lys 
50 55 60 

Phe Gin Asp Arg Val Ser Leu Thr Gly Asp Thr Ser Ala Ser Thr Val 
65 70 75 80 

Tyr Leu Glu Leu Arg Asn Leu Arg Phe Ala Asp Thr Ala Val Tyr Tyr 
85 90 95 

Cys Ala Arg Val Gly Glu Trp Thr Trp Asp Asp Phe Pro Gin Asp Asn 
100 105 110 

Tyr Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val 
115 120 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: 

Gin Val Lys Leu Leu Glu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly 
15 10 15 

Ala Ser Val Lys Leu Ser Cys Gin Ala Ser Gly Tyr Arg Phe Ser Asn 
20 25 30 

Phe Val Leu His Trp Ala Arg Gin Ala Pro Gly His Arg Pro Glu Trp 
35 40 45 

Met Gly Trp lie Asn Pro Ala Asn Gly Val Thr Glu lie Ser Pro Lys 
50 55 60 

Phe Gin Asp Arg Val Ser Leu Thr Gly Asp Thr Ser Ala Ser Thr Val 
65 70 75 80 

Tyr Leu Glu Leu Arg Asn Leu Arg Phe Ala Asp Thr Ala Val Tyr Tyr 
85 90 95 

Cys Ala Arg Val Gly Glu Trp Thr Trp Asp Asp Ser Pro Gin Asp Asn 
100 105 110 

Tyr Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val Thr 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

Gin Val Lys Leu Leu Glu Gin Ser Gly Thr Glu Val Lys Lys Pro Gly 
1 5 10 15 

Ala Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Arg Phe Thr Asn 
20 25 30 



v 
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Phe Pro Leu His Trp Val Arg Gin Ala Pro Gly Gin Arg Pro Glu Trp 
35 40 45 

Met Gly Trp lie Lys lie Val Asn Gly Glu Lys Lys Tyr Ser Gin Lys 
50 55 60 

Phe Val Asp Arg Val Thr Phe Thr Gly Asp Thr Ser Ala Asn Thr Ala 
65 70 75 80 

Tyr Met Glu Val Arg Gly Leu Arg Ser Ala Asp Thr Ala Thr Tyr Tyr 
85 90 95 

Cys Ala Arg Val Gly Glu Trp Thr Trp Asp Met Asp Pro Gin Ala Asn 
100 105 110 

Tyr Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val Thr 
115 . 120 125 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:130: ; 

. ,■ j . 

Gin Val Lys Leu Leu Glu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly 
1 5 10 15 

Ala Ser Val Lys Val Ser Cys Gin Ala Ser Gly Tyr Arg Phe Ser Asn 
20 25 30 

Phe Val He His Trp Val Arg Gin Ala Pro Gly Gin Arg Phe Glu Trp 
35 40 45 

Met Gly Trp He Asn Fro Tyr Asn Gly Asn Lys Glu Phe Ser Ala Lys 
50 55 60 

Phe Arg Asp Arg Val Thr Phe Thr Ala Asp Thr Asp Ala Asn Thr Ala 
65 70 75 80 

Tyr Met Glu Leu Arg Ser Leu Arg Ser Ala Asp Thr Ala He Tyr Tyr 
85 90 95 

Cys Ala Arg Val Gly Pro Tyr Thr Trp Asp Asp Ser Pro Gin Asp Asn 
100 , 105 110 
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Tyr Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val 
115 120 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Gin Val Lys Leu Leu Glu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly 
1 5 10 15 

Ala Ser Val Lys Val Ser Cys Gin Ala Ser Gly Tyr Arg Phe Ser Asn 
20 25 30 

Phe Val Leu His Trp Ala Arg Gin Ala Pro Thr Gin Asp Leu Glu Trp 
35 40 45 

Met Gly Trp lie Asn Pro Ala Asn Gly Val Lys Glu lie Ser Pro Lys 
50 55 60 

Phe Gin Asp Arg Val Ser Leu Thr Gly Asp Thr Ser Ala Ser Thr Val 
65 70 75 80 

Tyr Leu Glu Leu Arg Ser Leu Arg Phe Ala Asp Thr Ala Val Tyr Tyr 
85 90 95 

Cys Ala Arg Val Gly Glu Trp Thr Trp Asp Asp Ser Pro Gin Asp Asn 
100 105 110 

Tyr Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val 
115 120 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
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Gin Val Lys Leu Leu Glu Gin Ser Gly Ala Glu Val Lys Lys Pro Gly 
15 10 15 

Ala Ser Val Lys Val Ser Cys Gin Ala Ser Gly Tyr Arg Phe Ser Asn 
20 25 30 

Phe Val Leu His Trp Ala Arg Gin Ala Pro Gly His Arg Pro Glu Trp 
35 40 45 

Met Gly Trp lie Asn Pro Ala Asn Gly Val Thr Glu lie Pro Pro Lys 
50 55 60 

Phe Gin Asp Arg Val Ser Leu Thr Arg Asp Thr Ser Ala Gly Thr Val 
65 70 75 80 

Tyr Leu Glu Leu Thr Asn Leu Arg Phe Ala Asp Thr Ala Val Tyr Tyr 
85 90 95 

Cys Ala Arg Val Gly Glu Trp Thr Trp Asp Asp Ser Pro Gin Asp Asn 
100 105 110 

Tyr Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val 
115 120 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: ; 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 
TCGAGGGTCG GTCGGTCTCT AGACGGTCGG TCGGTCA 37 
(2) INFORMATION FOR SEQ ID N0:134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
CTAGTGACCG ACCGACCGTC TAGAGACCGA CCGACCC 
(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 
CGGTCGGTGG GTCGTCGAGG GTCGGTGGGT GT 
(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 
CTAGAGACCG ACCGACCGTC GAGGACCGAC CGACCGAGCT 
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(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
CAAGGAGACA GGATCCATGA AATAC 
(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
AGGGCGAATT GGATCCCGGG CCCCC 
(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
CTAGTGATGA TGATGATGAT TAA6CTAGC 2? 
(2) INFORMATION FOR SEQ ID NO: 140: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
CTAGGCTAGC TTAATGATGA TGATGATGA 29 
(2) INFORMATION FOR SEQ ID N0:141: 

(i) SEQUENCE CHARACTERISTICS: > 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal , f 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label- J 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /label- ZC 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
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Ser He Ser He Gly Pro Gly Arg Ala Phe Tyr Thr Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Leu Leu Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Glu Thr Leu Ser 
1 5 10 15 

Leu Thr Cys Thr Val Ser Gly Gly Ser Leu Ser Ser Phe Asp Trp Asn 
20 25 30 

Trp He Arg Gin Pro Ala Gly Lys Gly Leu Glu Trp He Gly Arg He 
35 40 45 

Tyr Pro Ser Gly Asn Thr His Tyr Asn Pro Ser Leu Arg Ser Arg Val 
50 55 60 

Thr Met Ser Arg Asp Thr Ser Lys Asn Gin Phe Ser Val Lys Leu Thr 
65 70 75 80 

Ser Val Thr Ala Ala Asp Thr Ala Leu Tyr Tyr Cys Ala Arg Glu Asn 
85 90 95 

Thr Gly Arg Thr He Glu Glu He Gly Asn Phe Phe Asp He Trp Gly 
100 105 110 

Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
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Leu Leu Lys Ser Gly Gly Gly Leu Val Lys Pro Gly Gly Ser Leu Arg 
15 10 15 

Leu Ser Cys Val He Ser Ala Phe Ser Phe Ser Gly Tyr Asn He Asn 
20 25 30 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ser Ser He 
35 40 45 

Ser Met Ser Thr Gly Ser Leu Ser Tyr Ala Asp Ser Met Lys Gly Arg 
50 55 60 

Phe Thr He Ser Arg Asp Asn Ala Lys Asn Ser Val Tyr Leu Glu Met 
65 70 75 80 

Ser Ser Leu Thr Ala Glu Asp Thr Ala Met Tyr Tyr Cys Ala Ala Arg 
85 90 95 

Thr Pro Leu Val Gly Arg Ala Leu Asp He Trp Gly Gin Gly Thr Val 
100 105 }10 

Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
115 120 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

Leu Leu Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly Ser Leu Arg 
1 5 10 15 

Leu Ser Cys Ser Ala Ser Gly Phe Thr Phe Ser Ser Tyr Gly Met Asn 
20 25 30 

Trp Val Arg Gin Ala Pro Gly Lys Gly Pro Glu Trp Val Ala Tyr He 
35 40 45 

Ser Ser Ser Arg Lys Tyr Thr Glu Tyr Ala Asp Ser Val Lys Gly Arg 
50 55 60 

Phe Thr He Ser Arg Glu Asn Ala Lys Tyr Ser Val Phe Leu Gin Leu 
65 70 75 80 
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Asp Ser Leu Thr Ala Glu Asp Thr Ala He Tyr Tyr Cys Ala Arg Gly . 

85 90 " 95 mV 

Arg Asp Phe Tyr Ser Gly Phe Gly Arg Arg Asp Asp Fhe His Leu His 
100 105 : 110 

Tyr Met Asp Val Trp Gly Lys Gly Thr Thr Val Thr Val Ser Ser Ala 
115 120 125 

Ser Thr Lys Gly 
130 

(2) INFORMATION FOR SEQ ID N0:145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid . 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein .'/^ .V V :V 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

Gly Leu Val Gin Pro Gly Gly Ser Leu 

f; 10 15 

Gly Asp lie Fhe Tyr Ser Tyr Ala Met 
25 30 

Gly Lys Gly Leu Glu Trp Val Ala Ser 
40 45 

Asn Tyr Tyr Ala Asp Ser Val Lys Gly 
60 

Asn Ser Lys Ser Thr Leu Tyr Leu Gin 

75 . 80 't 

Asp Thr Ala Leu Tyr Tyr Cys Ala Arg 
90 95 

lie Arg Gly Trp Phe Asp Ser Trp Gly 
105 110 

Ser Ser Ala Ser Thr Lys Gly 
120 125 

(2) INFORMATION FOR SEQ ID N0:146: 

(i) SEQUENCE CHARACTERISTICS: 



Leu Leu Glu Gin Ser Gly Gly 
1 5 

Arg lie Ser Cys Val Ala Ser 
20 

Ser Trp Val Arg Gin Ala Pro 
35 



lie Ser Gly Thr Gly Gly Ser 
50 55 



Arg Phe Thr lie Ser Arg Asp 
65 70 

Met Asn Ser Leu Arg Ala Glu 
85 



Asp Arg Gly Pro Arg lie Gly 
100 



Gin Gly Thr Leu Val Thr Val 
115 
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(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

Leu Leu Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg 
15 10 15 

Leu Ser Cys Ala Ala Ser Gly Phe Leu Tyr Ser Ser Fhe Ala Met Ser 
20 25 30 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Ala Trp Val Ser Thr lie 
35 40 45 

Ser Ala Ser Gly Gly Ser Thr Lys Tyr Ala Asp Ser Val Lys Gly Arg 
50 55 60 

Phe lie lie Ser Arg Asp Asn Ser Lys Asn Thr lie Tyr Leu Gin Met 
65 70 75 80 

Asp Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala Lys Asn 
85 90 95 

Phe Arg Ala Fhe Ala Arg Asp Pro Trp Gly Asp Trp Gly Gin Gly Thr 
100 105 110 

Leu Val Thr Val Ser Ser Ala Ser Ala Ser Thr Lys 
115 120 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

Met Ala Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly 
15 10 15 

Glu Arg Val lie Val Ser Cys Arg Ala Ser Gin Ser Val Ser Ser Asn 
20 25 30 
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Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu 
35 40 45 

lie Tyr Gly Ala Ser Asn Arg Ala Thr Gly lie Pro Asp Arg Phe Ser 
50 55 60 

Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Arg Leu Glu 
65 70 75 80 

Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Tyr Gly Ser Ser Gly 
85 90 95 

Thr Phe Gly Gin Gly Thr Lys Val Glu lie Lys Arg Thr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Met Ala Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly 
1 5 10 15 

Glu Arg Ala Thr Phe Ser Cys Arg Ser Ser His Ser lie His Thr Arg 
20 25 30 

Arg Val Ala Trp Tyr Gin His Lys Pro Gly Gin Ala Pro Arg Leu Val 
35 40 45 

lie His Gly Val Ser Asn Arg Ala Ser Gly lie Ser Asp Arg Phe Ser 
50 55 ,60 

Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Thr Arg Val Glu 
65 70 75 80 

Pro Glu Asp Phe Ala Leu Tyr Tyr Cys Gin Val Tyr Gly Ala Ser Ser 
85 90 95 

Tyr Thr Phe Gly Gin Gly Thr Lys Leu Glu Arg Lys Arg Thr Val Val 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 149: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

Met Ala Glu Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly 
1 5 10 15 

Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gin Ser Val Ser Asn Gly 
20 25 30 

Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu 
35 40 45 

lie Tyr Gly Ala Ser Thr Arg Ala Thr Asp lie Pro Asp Arg Phe Ser 
50 55 60 

Gly Ser Gly Ser Gly Ala Asp Phe Thr Leu Ala He Ser Arg Leu Glu 
65 70 75 80 

Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Tyr Ala Gly Ser His 
85 90 95 

Thr Phe Gly Gin Gly Thr Lys Leu Glu He Lys Arg Thr Val Ala 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

Met Ala Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 15 

Asp Arg Val Thr He Thr Cys Arg Pro Ser Gin Gly He Gly Arg Phe 
20 25 30 

Phe Asn Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Asn Leu Leu He 
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35 40 45 

Tyr Ala Ala Asp lie Leu Gin Ser Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 

Ser Gly Ser Gly Thr Asp Hie Thr Leu Thr lie Ser Ser Leu Gin Pro 

65 70 75 . , . 80 

Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin Ser Tyr Ser Thr Pro Tyr 
35 90 95 

Thr Phe Gly Gin Gly Thr Arg Leu Asp lie Lys Arg Thr Val Ala 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

Met Ala Glu Leu Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
1 5 10 15 

Asp Arg Val Thr lie Thr Cys Arg Ala Ser Gin Gly Val Ser Ser Ser 
20 25 30 

Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Val 
35 40 45 

lie Phe Gly Ala Tyr Ser Arg Ala Thr Gly lie Pro Asp Arg Phe Ser 
50 55 60 

Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Arg Leu Glu 
65 70 75 80 

Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Tyr Gly Ser Ser Pro 
85 90 95 

lie Thr Phe Gly Pro Gly Thr Lys Val Asp lie Lys Arg Thr Val Ala 
100 105 110 
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What Is Claimed Is: ■ 

1. A human monoclonal antibody capable of 
immunoreacting with human immunodeficiency virus (HIV) 
glycoprotein gpl20 and neutralizing HIV, wherein the 

5 monoclonal antibody has the binding specificity of a:; 
monoclonal antibody comprising a heavy chain 
immunoglobulin variable region amino acid residue 
sequence selected from the group consisting of SEQ ID 
NOs 66, 67, 68, 70, 72, 73, 74, 75, 78 and 97, 
10 and conservative substitutions thereof. 

2. The human monoclonal antibody of claim l 
wherein the monoclonal antibody has the binding 
specificity of a monoclonal antibody having heavy and 
light chain immunoglobulin variable region amino acid:: ^ 

15 residue sequences in pairs selected from the group 

consisting of SEQ ID NOs 66:95, 67:96, 72:102, 66:97, 
73:107, 74:103, 70:101, 68:98, 75:104, 72:105, 78:110, 
66:118, 66:122, 66:121, 66:115, 97:124, 97:132 and 
66:98, and conservative substitutions thereof. 

20 3. The human monoclonal antibody of claim 1, 

wherein the monoclonal antibody has the binding 
specificity of a monoclonal antibody produced by ATCC 
69078, ATCC 69079 or ATCC 69080. 

4. The human monoclonal antibody of claim 3, 
25 wherein the monoclonal antibody is the monoclonal 

antibody produced by ATCC 69078, ATCC 69079 or ATCC • 
69080. 

5. A human monoclonal antibody capable of 
immunoreacting with human immunodeficiency virus (HIV) 

30 glycoprotein gpl20 and neutralizing HIV, wherein the 
monoclonal antibody has the binding specificity of a 
monoclonal antibody comprising a light chain 
immunoglobulin variable region amino acid residue 
sequence selected from the group consisting of SEQ ID ■ 

35 NOS 95, 96, 97, 98, 101, 102, 103, 104, 105, 107, 110, 
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115, 118 , 121 , 122, 124 and 132, and conservative 
substitutions thereof* 

6. A polynucleotide sequence encoding a heavy 
chain immunoglobulin variable region amino acid 

5 residue sequence portion of a human monoclonal 
antibody capable of immunoreacting with human 
immunodeficiency virus (HIV) glycoprotein gp!20 and 
neutralizing HIV, wherein the monoclonal antibody has 
the binding specificity of a monoclonal antibody 

10 comprising said heavy chain immunoglobulin variable 
region amino acid residue sequence selected from the 
group consisting of SEQ ID NOs 66, 67, 68, 70, 72, 73, 
74, 75, 78 and 97, and conservative substitutions of 
the amino acid residue sequence, and polynucleotide 

15 sequences complementary thereto. 

7. The polynucleotide sequence of claim 6, 
wherein the polynucleotide is DNA. 

8. A polynucleotide sequence encoding a light 
chain immunoglobulin variable region amino acid 

20 residue sequence portion of a human monoclonal 
antibody capable of immunoreacting with human 
immunodeficiency virus (HIV) glycoprotein gpl20 and 
neutralizing HIV, wherein the monoclonal antibody has 
the binding specificity of a monoclonal antibody 

25 comprising said light chain immunoglobulin variable 
region amino acid residue sequence selected from the 
group consisting of SEQ ID NOs 95, 96, 97, 98, 101, 
102, 103, 104, 105, 107, 110, 115, 118, 121, 122, 124 
and 132, and conservative substitutions of the amino 

30 acid residue sequence, and polynucleotide sequences 
complementary thereto. 

9. A human monoclonal antibody capable of 
immunoreacting with human immunodeficiency virus (HIV) 
glycoprotein gp41 and neutralizing HIV, wherein the 

35 monoclonal antibody has the binding specificity of a 
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monoclonal antibody comprising a heavy chain 
immunoglobulin variable region amino acid residue 
sequence selected from the group consisting of SEQ ID 
NOs 142, 143 r 144, 145 and 146, and conservative 
substitutions thereof. 

10 . The human monoclonal antibody of claim 9 
wherein the monoclonal antibody has the binding 
specificity of a monoclonal antibody having heavy and 
light chain immunoglobulin variable region amino acid 
residue sequences in pairs selected from the group 
consisting of SEQ ID NOs 142:147, 143:148, 144:149, 
145:150, and 146:151, and conservative substitutions 
thereof. 

11. A human monoclonal antibody capable of 
immunor eact ing with human immunodeficiency virus (HIV) 
glycoprotein gp41 and neutralizing HIV, wherein the 
monoclonal antibody has the binding specificity of a 
monoclonal antibody comprising a light chain 
immunoglobulin variable region amino acid residue 
sequence selected from the group consisting of SEQ ID 
NOs 147, 148, 149, 150, and 151 and conservative 
substitutions thereof. 

12. A polynucleotide sequence encoding a heavy 
chain immunoglobulin variable region amino acid 
residue sequence portion of a human monoclonal 
antibody capable of immunoreacting with human 
immunodeficiency virus (HIV) glycoprotein gp41 and 
neutralizing HIV, wherein the monoclonal antibody has 
the binding specificity of a monoclonal antibody 
comprising said heavy chain immunoglobulin variable 
region amino acid residue sequence selected from the 
group consisting of SEQ ID NOs 142, 143, 144, 145, and 
146, and conservative substitutions of the amino acid 
residue sequence, and polynucleotide sequences 
complementary thereto. 
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13. The polynucleotide sequence of claim 12, 
wherein the polynucleotide is DNA. 

14 * A polynucleotide sequence encoding a light 
chain immunoglobulin variable region amino acid 
residue sequence portion of a human monoclonal 
antibody capable of immunoreacting with human 
immunodeficiency virus (HIV) glycoprotein gp41 and 
neutralizing HIV, wherein the monoclonal antibody has 
the binding specificity of a monoclonal antibody 
comprising said light chain immunoglobulin variable 
region amino acid residue sequence selected from the 
group consisting of SEQ ID NOs 147, 148, 149, 150, and 
151, and conservative substitutions of the amino acid 
residue sequence, and polynucleotide sequences 
complementary thereto. 

15. A host cell comprising the polynucleotide 
sequence of claim 6, 8, 12 or 14. 

16. A DNA expression vector comprising the 
polynucleotide sequence of claim 6,8, 12 or 14 . 

17. A method of detecting human immunodeficiency 
virus (HIV) comprising contacting a sample suspected 
of containing HIV with a diagnostically effective 
amount of the monoclonal antibody of claim 1, 5, 9 or 
11 and determining whether the monoclonal antibody 
immunoreacts with the sample. 

18. The method of claim 17, wherein the 
detecting is in vivo . 

19. The method of claim 18, 'wherein the 
monoclonal antibody is detect ably labelled with a 
label selected from the group consisting of a 
radioisotope and a paramagnetic label. 

20. The method of claim 17, wherein the 
detecting is in vitro . 

21 i* The method of claim 20, wherein the 
monoclonal antibody is detectably labelled with a 
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label selected from the group consisting of a 
radioisotope, a fluorescent compound, a colloidal 
metal, a chemiluminescent compound, a bioluminescent 
compound, and an enzyme. 

22. The method of claim 20, wherein the 
monoclonal antibody is bound to a solid phase* 

23. A method for providing passive immunotherapy 
to human immunodeficiency virus (HIV) disease in a 
human, comprising administering to the human an 
immunotherapeutically effective amount of the 
monoclonal antibody of claim l, 5, 9 or 11. 

24. The method of claim 23, wherein the passive 
immunotherapy is provided prophylactically. 

25. The method of claim 23, wherein the 
administering is parenteral administration. 

26. The method of claim 25, wherein the 
parenteral administration is by subcutaneous, 
intramuscular , intraperitoneal, intracavity, 
transdermal, or intravenous injection. 

27. The method of claim 25, wherein the 
parenteral administration is by gradual perfusion. 

28. The method of claim 27, wherein the gradual 
perfusion is by intravenous or peristaltic means. 

29. The method of claim 25, wherein the 
immunotherapeutically effective amount is from about 
0.1 mg/kg to about 300 mg/kg. 

30. A method for inducing active immunotherapy 
to human immunodeficiency virus (HIV) disease in a 
human which comprises administering to the human an 
immunogenically effective amount of an anti-idiotype 
antibody to the monoclonal antibody of claim 1, 5, 9 
or 11. 

31. A pharmaceutical composition comprising at 
least one dose of an immunotherapeutically effective 
amount of the monoclonal antibody of claim 1, 5, 9 or 
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11 in a pharmacological carrier. ' 

32. The pharmaceutical composition of claim 31 M 
wherein said composition contains two or more 
different monoclonal antibodies. \ 
5 33. A kit useful for the detection of human 

immunodeficiency virus (HIV) in a source suspected of 
containing HIV, the kit comprising carrier means being 
compartmentalized to receive in close confinement 
therein one or more containers comprising a container 
10 containing the monoclonal antibody of claim 1, 5, 9 or 
11. 
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: SHINE-DALGARNO MET 

GGCCGCAAATTCTATTTCAAGGAGACAGTCATAATG 
CGTTTAAGATAAAGTTCCTCTGTCAGTATTAC 



LEADER SEQUENCE 

'AAATACCTATTGCCTACGGCAGCCGCT 
TTTATGGATAACGGATGCCGTCGGCGA 

LEADER SEQUENCE 

GGATTGTTATTACTCGCTGCCCAACCAG 
CCTAACAATAATGAGCGACGGGTTGGTC 

LINKER LINKER 
V H BACKBONE 



NCOI 



XHOI SPEI 



ccatgg'cccaggtgaaactg'ctcgagatttceagactagt' 
ggtaccgggtccactttgacgagctctaaagatctgatca 

' , stop linker 

'TyrProTyrAspValProAspTyrAlaSer' , 

TACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTCG 
ATGGGCATGCTGCAAGGCCTGATGCCAAGAATTATCTTAAGCAGCT 



Figure 1 
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